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PREFACE 


In  the  present  volume  we  have  tried  to  present  in  an  elementary  and 
formal  manner  the  mathematical  theory  of  games  of  strategy  and  some  of 
its  applications.  Although  many  of  the  applications  are  discussed  in  mili¬ 
tary  terms,  they  can  easily  be  formulated  in  economic  or  social  science 
terms. 

An  attempt  has  been  made  to  develop  the  subject  matter  in  such  a 
way  as  to  make  the  volume  adaptable  as  a  text  on  the  theory  of  games  in 
colleges  and  universities.  The  analytical  level  of  the  book  is  such  that  one 
year  of  calculus  can  serve  as  a  prerequisite  for  the  course. 

The  book  starts  in  Chapter  1  with  an  exposition  of  games  of  strategy, 
with  examples  taken  from  parlor  games  as  well  as  from  military  games. 
The  next  two  chapters  treat  the  basic  topics  in  the  theory  of  finite  games, 
i.e.,  the  existence  of  optimal  strategies  and  their  properties.  Chapters  4 
and  5  deal  with  the  representation  of  games  and  the  computation  of  op¬ 
timal  strategies.  Since  many  games  involve  an  infinite  number  of  strate¬ 
gies,  Chapters  6,  7,  and  8  deal  with  such  games  by  developing  the  necessary 
mathematics  (e.g.,  probability  distribution  functions  and  Stieltjes  inte¬ 
grals)  for  handling  infinite  games.  The  results  on  infinite  games  are  applied 
in  Chapters  9  and  10  to  two  general  classes  of  games — timing  games  and 
tactical  games.  Finally,  the  last  chapter  provides  an  application  of  mo¬ 
ment  space  theory  to  the  solution  of  infinite  games. 

Many  of  the  chapters  are  independent  of  one  another;  for  example,  the 
chapter  on  games  in  extensive  form  may  be  introduced  after  Chapter  1, 
while  the  last  four  chapters  may  be  studied  in  any  order  after  Chapter  7. 

This  study  was  undertaken  by  The  RAND  Corporation  as  a  part  of 
its  research  program  for  the  United  States  Air  Force. 
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GAME,  STRATEGY, 
AND  SADDLE-POINT 


t.  INTRODUCTION 

The  theory  of  games  of  strategy  may  be  described  as  a  mathematical 
theory  of  decision-making  by  participants  in  a  competitive  environment. 
In  a  typical  problem  to  which  the  theory  is  applicable,  each  participant 
can  bring  some  influence  to  bear  upon  the  outcome  of  a  certain  event;  no 
single  participant  by  himself  nor  chance  alone  can  determine  the  out¬ 
come  completely.  The  theory  is  then  concerned  with  the  problem  of  choos¬ 
ing  an  optimal  course  of  action  which  takes  into  account  the  possible 
actions  of  the  participants  and  the  chance  events. 

Examples  of  games  of  strategy  are  parlor  games  such  as  poker,  chess, 
and  bridge,  military  games  such  as  the  defense  of  targets  against  attack, 
and  economic  games  such  as  the  price  competition  between  two  sellers. 
Each  of  these  games  of  strategy  allows  the  players  to  make  use  of  their 
ingenuity  in  order  to  influence  the  outcome.  Although  some  of  these  games 
involve  elements  of  chance  (e.g.,  the  cards  dealt  in  poker  and  the  hit  of  a 
target) ,  we  shall  exclude  from  our  discussion  games  of  chance  such  as  black¬ 
jack  and  craps  since  their  outcomes  depend  entirely  upon  chance  and 
cannot  be  influenced  by  the  decisions  of  the  players. 

Games  of  chance  have  been  studied  mathematically  for  many  years; 
the  mathematical  theory  of  probability  was  developed  from  their  study. 
Although  strategic  situations  have  long  been  observed  and  recorded,  the 
first  attempt  to  abstract  them  into  a  mathematical  theory  of  strategy  was 
made  in  1921  by  Emile  Borel.  The  theory  was  firmly  established  by  John 
von  Neumann  in  1928  when  he  proved  the  minimax  theorem,  the  funda¬ 
mental  theorem  of  games  of  strategy.  However,  it  was  not  until  the  pub¬ 
lication  in  1944  of  the  impressive  work,  Theory  of  Games  and  Economic 
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Behavior,  by  John  von  Neumann  and  Oskar  Morgenstern,  that  the  mathe¬ 
matical  theory  of  games  received  much  attention.  This  book  emphasized  a 
new  approach  to  the  general  problem  of  competitive  behavior  through  a 
study  of  games  of  strategy.  Since  then  the  theory  has  been  applied  to 
game-like  problems — not  only  in  economics,  but  also  in  the  military  and 
politics. 

2.  DESCRIPTION  OF  A  GAME  OF  STRATEGY 

A  game  of  strategy  is  described  by  its  set  of  rules.  These  rules  specify 
clearly  what  each  person  called  a  'player  is  allowed  or  required  to  do  under 
all  possible  circumstances.  The  rules  define  the  amount  of  information,  if 
any,  each  player  receives.  If  the  game  requires  the  use  of  chance  devices, 
the  rules  describe  how  the  chance  events  shall  be  interpreted.  They  also 
define  the  time  the  game  ends,  the  amount  each  player  pays  or  receives,  and 
the  objective  of  each  player. 

From  the  rules  we  can  obtain  such  general  properties  of  the  game  as 
the  number  of  moves,  the  number  of  players,  and  the  payoff.  The  game 
is  finite  if  each  player  has  a  finite  number  of  moves  and  a  finite  number  of 
choices  available  at  each  move.  It  is  convenient  to  classify  games  according 
to  the  number  of  players,  i.e.,  as  2-person,  3-person,  etc.  It  is  also  conven¬ 
ient  to  distinguish  between  games  whose  payoffs  are  zero-sum  and  those 
whose  are  not.  If  the  players  make  payments  only  to  each  other,  the  game 
is  said  to  be  zero-sum. 

We  shall  first  analyze  the  2-person,  zero-sum,  finite  game — most  parlor 
games  and  many  military  games  are  of  this  type.  It  is  clear  that  in  such  a 
game  the  winnings  of  one  player  are  the  losses  of  the  other  player. 

To  simplify  the  mathematical  description  of  a  game,  we  introduce  the 
concept  of  a  “strategy.”  In  the  actual  play  of  a  game,  instead  of  making  his 
decision  at  each  move  each  player  may  formulate  in  advance  of  the  play 
a  plan  for  playing  the  game  from  beginning  to  end.  Such  a  plan  must  be 
complete  and  cover  all  possible  contingencies  that  may  arise  in  the  play. 
Among  other  things,  this  plan  would  incorporate  any  information  which 
may  become  available  to  the  player  in  accordance  with  the  rules  of  the 
game.  Such  a  complete  prescription  for  a  play  of  a  game  by  the  player  is 
called  a  strategy  of  that  player.  A  player  using  a  strategy  loses  no  freedom 
of  action  since  the  strategy  specifies  the  player’s  actions  in  terms  of  the 
information  that  might  become  available. 

We  may  think  of  a  strategy  of  a  player  as  a  set  of  instructions  for  play¬ 
ing  the  given  game  from  the  first  move  to  the  last.  Conversely,  each  dif¬ 
ferent  way  that  a  player  may  play  a  game  is  a  strategy  of  that  player.  If 
we  enumerate  all  the  different  ways  for  a  player  to  play  the  game,  we  ob¬ 
tain  all  the  strategies  of  that  player.  Of  course,  many  of  these  strategies 
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may  be  obviously  poor  ones,  but  they  are  included  in  the  initial 
enumeration. 

Every  pair  of  strategies,  consisting  of  one  strategy  for  each  player, 
determines  a  play  of  the  game,  which  in  turn  determines  a  payoff  to  each 
player.  Thus  we  may  consider  a  play  of  a  game  to  consist  of  each  player’s 
making  one  decision,  namely  the  selection  of  a  strategy. 

In  terms  of  this  notion  of  strategy,  we  can  now  describe,  corresponding 
to  any  given  game,  an  equivalent  game  of  much  simpler  character.  Let  us 
call  the  two  players  Blue  and  Red.  Suppose  Blue  has  m  strategies,  which 
may  be  designated  by  the  numbers 

i  =  1,  2,  . .  . ,  m. 

Suppose  Red  has  n  strategies,  which  we  may  designate  by 

j  =  1,2,  ...  ,n. 

Then  on  the  first  move  Blue  chooses  some  strategy  i.  On  the  next  move, 
Red,  without  being  informed  what  choice  Blue  has  made,  chooses  strategy 
j.  These  two  choices  determine  a  play  of  the  game  and  a  payoff  to  the  two 
players.  Let  ay  be  the  payoff  to  Blue.  Since  the  game  is  zero-sum,  the 
payoff  to  Red  is  -a,,-. 

The  game  is  thus  determined  by  Blue’s  payoff  matrix, 


an 

an  ■  •  • 

^1  n 

a2i 

an  ... 

«2n 

_  a,ni 

a,«2  •  •  • 

_ 

In  this  matrix  each  Blue  strategy  is  represented  by  a  row;  each  Red  strategy 
is  represented  by  a  column.  If  Blue  chooses  the  ith  strategy  or  ith  row  and 
Red  chooses  the  jth  strategy  or  jth  column,  then  Red  is  to  pay  Blue  the 
amount  a y.  Blue  wants  ay  to  be  as  large  as  possible,  but  he  controls  only 
the  choice  of  his  strategy  i.  Red  wants  ay  to  be  as  small  as  possible  but  he 
controls  only  the  choice  of  his  strategy  j.  What  are  the  guiding  principles 
which  should  determine  the  choices  and  what  is  the  expected  outcome 
of  the  game? 

3.  ILLUSTRATIVE  EXAMPLES 

Example  1.  Function  of  a  Field  Commander. 

Military  action  by  a  commander  against  an  enemy  requires  him  to 
evaluate  a  situation  and  make  a  decision.  The  evaluation  is  made  by  con¬ 
sidering  the  given  mission,  the  capabilities  of  the  enemy,  and  the  possible 
courses  of  action  available  to  both  sides,  and  by  comparing  the  likely  con¬ 
sequences  of  one’s  own  courses  of  action.  These  are  the  rules  of  the  game. 
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From  his  possible  courses  of  action,  the  commander  selects  an  optimal 
one — a  course  which  promises  to  be  most  successful  in  accomplishing  the 
mission. 

The  military  doctrine  covering  the  above  evaluation  is  discussed  in 
detail  in  the  Naval  Manual  of  Operational  Planning  of  the  Naval  War 
College.  The  Manual  states,  “Each  of  our  own  courses  of  action  ...  is 
separately  weighed  in  turn  against  each  capability  of  the  enemy  which  may 
interfere  with  the  accomplishment  of  the  mission.  The  results  to  be  ex¬ 
pected  in  each  case  are  visualized.  The  advantages  and  disadvantages 
noted  as  a  result  of  the  analysis  for  each  of  our  own  courses  of  action  are 
summarized  and  the  various  courses  of  action  are  compared  and  weighed.” 
A  commander  thus  enumerates  the  possible  opposing  courses  of  action  and 
their  outcomes.  Each  course  of  action  is  a  strategy  for  the  commander  and 
each  outcome  yields  a  payoff. 

Suppose  that  the  mission  of  the  Blue  forces  is  to  capture  an  objective 
defended  by  the  Red  forces.  The  Blue  commander  analyzes  the  possible 
courses  of  action  within  the  capabilities  of  the  Red  forces  which  can  affect 
the  capture  of  the  objective.  Then  Blue  lists  the  various  ways  of  capturing 
the  objective.  Let  us  assume  that  this  yields  three  possible  courses  of 
action,  n,  r2,  r3,  to  the  Red  commander  and  three  possible  courses  of  action, 
bi,  bi,  bs,  to  the  Blue  commander.  There  are  then  nine  possible  outcomes 
which  may  be  summarized  by  a  3  X  3  matrix.  For  example,  the  matrix 
may  have  the  following  descriptive  entries: 

“Estimate  op  the  Situation”  by  Blue  Commander 
Red  courses  of  action 


n 

r2 

r3 

h 

"Fail 

Succeed 

Succeed 

Blue  courses 
of  action 

b2 

Draw 

Succeed 

Draw 

Fail 

b. 

Succeed 

Draw 

In  this  illustrative  situation,  the  Blue  commander  estimates  that  if  he 
uses  strategy  b i,  for  example,  and  the  Red  commander  uses  n,  Blue  will 
fail  to  capture  the  objective.  However,  Blue  will  succeed  if  he  uses  strat¬ 
egy  bi  and  the  Red  commander  uses  strategy  r2  or  r3.  Similarly,  the  remain¬ 
ing  six  outcomes  are  the  Blue  commander’s  evaluation  of  the  outcomes 
with  respect  to  various  courses  of  actions. 

The  problem  of  the  Blue  commander  is  to  select  the  best  course  of 
action,  or  the  strategy  which  will  secure  for  him  the  most  likely  chance 
of  capturing  the  objective.  The  Red  commander  will  likewise  try  to  select 
his  best  course  of  action,  or  that  strategy  which  will  minimize  Blue’s  chance 
of  capturing  the  objective.  We  thus  have  the  following  problem  in  com¬ 
petitive  behavior:  Blue  wishes  to  maximize  the  outcome  by  a  proper 
choice  of  his  strategy,  and  Red  wishes  to  minimize  this  same  outcome,  also 
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by  a  proper  choice  of  Red’s  strategy,  where  the  outcome  depends  upon 
both  Blue’s  and  Red’s  choices. 

The  established  doctrine  of  the  “Estimate  of  the  Situation”  dictates 
the  selection  of  the  course  of  action  wrhich  promises  to  be  most  successful 
in  the  accomplishment  of  the  mission  regardless  of  the  course  selected  by 
the  Red  commander.  Accordingly,  Blue  would  choose,  in  this  illustrative 
example,  strategy  b2  which  yields  either  a  successful  outcome  or  a  draw. 
For  if  the  Blue  commander  uses  a  strategy  bi  or  b3  he  may  fail  to  accom¬ 
plish  the  mission.  Strategy  b2  represents  the  most  conservative  decision  of 
the  Blue  commander.  It  is  equivalent  to  assuming  that  the  Red  commander 
can  find  out  the  decision  of  the  Blue  commander.  In  a  later  section  we 
shall  see  how  the  Blue  commander  can,  by  taking  chances,  or  bluffing,  ob¬ 
tain  on  the  average  a  more  advantageous  outcome.  He  accomplishes  this 
by  mixing  his  strategies,  or  by  choosing  his  strategy  with  a  chance  device. 

Example  2.  The  Game  “Morra.” 

This  is  a  game  which  has  only  personal  moves.  Each  player  shows  one, 
two,  or  three  fingers  and  simultaneously  calls  his  guess  of  the  number  of 
fingers  his  opponent  will  show.  If  just  one  player  guesses  correctly,  he  wins 
an  amount  equal  to  the  sum  of  the  fingers  shown  by  himself  and  his 
opponent;  otherwise  the  game  is  a  draw. 

This  game  consists  of  one  move  for  each  player — the  choice  of  a  number 
to  show  and  a  number  to  guess.  We  may  represent  a  strategy  for  each 
player  by  a  pair  of  numbers  (s,  g),  where  s  =  1,  2,  3  is  the  number  of  fingers 
he  shows  and  g  —  1,  2,  3  is  his  guess  of  the  number  of  fingers  his  opponent 
will  show.  It  is  evident  that  each  player  has  nine  strategies,  and  thus  there 
are  81  different  possible  plays  of  the  game.  With  each  of  these  81  ways  of 
playing  the  game,  there  is  associated  a  payment  to  the  players,  as  described 
by  the  rules  of  the  game.  These  payments  are  summarized  by  a  payoff 
matrix.  In  the  following  payoff  matrix,  the  entries  represent  payments 
to  Blue.  Red  will  receive  the  negative  of  these  payments. 

Morra  Payoff 


Red  strategies 


(1,  1) 

(1,  2) 

(1,  3) 

(2, 1) 

(2,  2) 

(2,3) 

(3,  1) 

(3,  2) 

(3,  3) 

(1, 1) 

0 

2 

2 

-3 

0 

0 

-4 

0 

0 

(b  2) 

-2 

0 

0 

0 

3 

3 

-4 

0 

0 

(1,  3) 

-2 

0 

0 

-3 

0 

0 

0 

4 

4 

(2,  1) 

3 

0 

3 

0 

-4 

0 

0 

-5 

0 

Blue  (2,  2) 

0 

-3 

0 

4 

0 

4 

0 

-5 

0 

strategies  (2,  3) 

0 

-3 

0 

0 

-4 

0 

5 

0 

5 

(3,  1) 

4 

4 

0 

0 

0 

-5 

0 

0 

-6 

(3,  2) 

0 

0 

-4 

5 

5 

0 

0 

0 

-6 

(3,  3) 

0 

0 

-4 

0 

0 

-5 

6 

6 

0 
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Example  3.  The  Game  “le  Her.” 

This  game  involves  both  chance  and  personal  moves.  Although  the 
game  can  be  played  by  several  people,  we  shall  describe  it  for  two  players. 
From  an  ordinary  deck  of  cards,  a  dealer  gives  a  card  at  random  to  a  receiver 
and  takes  one  himself ;  neither  player  sees  the  other’s  card.  The  main  object 
is  for  each  to  obtain  a  higher  card  than  his  opponent.  The  order  of  value 
is  ace,  two,  three,  .  .  .  ,  ten,  jack,  queen,  king.  Now  if  the  receiver  is  not 
content  with  his  card,  he  may  compel  the  dealer  to  change  with  him;  but 
if  the  dealer  has  a  king,  he  is  allowed  to  retain  it.  If  the  dealer  is  not  content 
with  the  card  which  he  at  first  obtained,  or  which  he  has  been  compelled 
to  take  from  the  receiver,  he  is  allowed  to  change  it  for  another  taken  out 
of  the  deck  at  random;  but  if  the  card  he  then  draws  is  a  king,  he  is  not 
allowed  to  have  it,  but  must  keep  the  card  which  he  held  after  the  receiver 
exercised  his  option.  The  two  players  then  compare  cards,  and  the  player 
with  the  higher  card  wins.  If  the  dealer  and  receiver  have  cards  of  the  same 
value,  the  dealer  wins. 

This  game  consists  of  three  moves:  The  first  move  is  a  chance  move, 
one  card  each  being  dealt  at  random  to  the  receiver  and  dealer.  The  second 
move  is  a  personal  move  by  the  receiver,  who  exchanges  his  card  with  the 
the  dealer  or  stays  with  the  original  card.  The  third  move  is  a  personal 
move  by  the  dealer,  who  exchanges  his  card  with  a  card  from  the  deck  or 
stays  with  the  card  he  holds. 

The  game  can  be  summarized  by  defining  a  strategy  for  the  receiver 
to  be  a  determination  of  change  or  stay  for  each  of  the  thirteen  cards  he 
may  receive.  One  such  strategy  might  be 

123456789  10  JQK 
[CSCSSCCSSC  SCS] 

where  “C”  means  change  and  “S”  means  stay.  Thus,  the  above  strategy 
tells  the  receiver  to  change  if  he  receives  an  ace,  stay  if  he  receives  a  two, 
change  if  he  receives  a  three,  .  .  .  ,  change  if  he  receives  a  queen,  stay  if 
he  receives  a  king.  Note  that  the  strategy  is  a  complete  set  of  instructions. 
It  is  apparent  that  the  receiver  has  218  strategies.  Of  course,  most  of  them 
are  poor  strategies  and  should  not  be  played,  as  will  be  described  in  a 
subsequent  section. 

Since  the  dealer’s  personal  move,  the  third  move  of  the  game,  is  made 
after  the  receiver  has  exercised  his  option,  the  dealer  may  then  have  infor¬ 
mation  about  the  receiver’s  card.  Therefore  a  strategy  for  the  dealer  must 
include  this  information.  However,  to  simplify  the  enumeration  of  the 
dealer’s  strategy,  let  us  make  the  following  two  observations  about  rational 
behavior  of  the  dealer: 
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(i)  If  the  receiver  exchanges  cards  with  the  dealer  and  the  dealer 
thereby  obtains  a  lower  card  than  the  receiver,  then  the  dealer 
should  exchange  with  the  deck. 

(ii)  If  the  receiver  exchanges  cards  with  the  dealer  and  the  dealer 
thereby  obtains  a  card  at  least  as  high  as  the  receiver’s,  then  the 
dealer  should  stay  with  that  card. 

Let  us  refer  to  these  two  instructions  of  rational  behavior  as  R.  They 
have  the  effect  of  eliminating  obviously  poor  strategies.  Now  the  strat¬ 
egies  of  the  dealer  may  be  enumerated  in  a  manner  similar  to  those  of  the 
receiver  and  at  the  same  time  incorporate  the  information  about  the  re¬ 
ceiver.  A  strategy  for  the  receiver  may  be  written  as  follows: 

(R;  S  C  C  S  .  .  .  S  C). 

This  strategy  tells  the  dealer  to  do  the  following:  if  the  receiver  exchanges 
cards,  the  dealer  follows  the  two  instructions  R  of  rational  behavior ;  if  the 
receiver  stays  with  his  card  and  the  dealer  holds  an  ace,  then  the  dealer 
stays;  if  the  receiver  stays  and  the  dealer  holds  a  two,  then  the  dealer 
changes  with  the  deck;  etc.  The  dealer  now  has  213  strategies.  Again,  most 
of  these  strategies  will  turn  out  to  be  poor  strategies  as  will  be  described 
in  a  later  section. 

Example  4.  Colonel  Blotto  Game. 

Colonel  Blotto  and  his  enemy  each  try  to  occupy  two  posts  by  properly 
distributing  their  forces.  Let  us  assume  that  Colonel  Blotto  has  4  regi¬ 
ments  and  the  enemy  has  3  regiments  which  are  to  be  divided  between 
the  two  posts.  Define  the  payoff  to  Colonel  Blotto  at  each  post  as  follows: 
If  Colonel  Blotto  has  more  regiments  than  the  enemy  at  the  post,  Colonel 
Blotto  receives  the  enemy’s  regiments  plus  one  (the  occupation  of  the  post 
is  equivalent  to  capturing  one  regiment) ;  if  the  enemy  has  more  regiments 
than  Colonel  Blotto  at  the  post,  then  Colonel  Blotto  loses  one  plus  his 
regiments  at  the  post;  if  each  side  places  the  same  number  of  regiments, 
it  is  a  draw  and  each  side  gets  zero.  The  total  payoff  is  the  sum  of  the 
payoffs  at  the  two  posts. 

Colonel  Blotto  has  5  strategies,  or  five  different  ways  of  dividing  4  regi¬ 
ments  between  the  two  posts.  The  enemy  has  4  strategies,  or  four  different 
ways  of  dividing  his  3  regiments.  There  are  therefore  twenty  ways  for  the 
two  sides  to  distribute  their  forces. 

It  is  evident  that  if  Colonel  Blotto  places  3  regiments  at  the  first  post 
‘  and  1  at  the  second,  and  if  the  enemy  places  2  regiments  at  the  first  post 
and  1  at  the  second,  then  Blotto  wins  what  amounts  to  3  regiments. 
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However,  if  Colonel  Blotto  places  2  regiments  at  each  post  and  the  enemy 
places  all  of  his  3  regiments  at  either  post,  then  Colonel  Blotto  loses  2 
regiments.  The  following  payoff  matrix  summarizes  the  payment  to 
Colonel  Blotto  for  each  of  the  twenty  possible  distributions: 


Colonel  Blotto  Payoff 


Colonel  Blotto 
strategies 


(4,0) 
(0,  4) 
(3,  1) 
(1,  3) 
(2,2) 


Enemy  strategies 
(3,0)  (0,3)  (2,1)  (1,2) 
“  4  0  2  1 

0  4  12 

1-13  0 

-110  3 

_— 2  -2  2  2 


Example  5.  A  Coin  Guessing  Game. 

Two  players  secretly  choose  to  conceal  0,  1 ,  or  2  coins.  Each,  in  agreed 
turn,  tries  to  guess  exactly  the  total  number  concealed  by  the  two  players. 
However,  the  second  guesser  must  guess  a  different  number  from  that 
called  by  the  first.  If  a  player  guesses  correctly,  he  receives  1  from  the 
other;  otherwise  0. 

A  strategy  involves  a  joint  decision  of  what  to  conceal  and  what  to 
call.  We  may  represent  a  strategy  of  Blue  by  a  vector  (x,  x  +  y)  where 
x  =  0,  1,  or  2  and  y  =  0,  1,  or  2.  The  first  component,  x,  of  the  vector 
represents  the  number  to  conceal ;  the  second  component,  x  +  y,  represents 
the  number  to  call.  Thus  Blue  has  9  strategies. 

A  strategy  for  the  player  guessing  second,  call  him  Red,  must  include 
the  information  about  Blue’s  guess,  which  will  be  0,  1,  2,  3,  or  4.  We  can 
represent  a  Red  strategy  by  a  vector, 

(H;  x0,  xh  x2,  xh  x4), 

where  H  =  0,  1,  2  is  the  number  of  coins  concealed  and  Xi  is  the  number  of 
coins  to  call  if  Blue  calls  i.  Red  has  10  strategies.  They  are  enumerated 
as  follows: 

(1)  =  (0;  10012),  (2)  =  (0;  10022),  (3)  =  (0;  10112), 

(4)  =  (0;  10122),  (5)  =  (1;  12123),  (6)  =  (1;  12323), 

(7)  =  (2;  22343),  (8)  =  (2;  22443),  (9)  =  (2;  23343), 

(10)  =  (2;  23443). 

The  payoff  matrix  is  now  readily  written.  It  is 
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Coin  Game  Payoff 


Blue 

strategies 


Red  strategies 


(1) 

(2) 

(3) 

(4) 

(5) 

(6) 

(7) 

(8) 

(9) 

(10) 

(0, 0) 

1 

1 

1 

1 

-1 

-1 

-1 

-1 

-1 

-1 

(0, 1) 

-1 

-1 

-1 

-1 

1 

1 

-1 

-1 

0 

0 

(0,  2) 

-1 

-1 

0 

0 

-1 

0 

1 

1 

1 

1 

(1,  1) 

1 

1 

1 

1 

-1 

-1 

0 

0 

-1 

-1 

a,  2) 

0 

0 

-1 

-1 

1 

1 

-1 

0 

-1 

0 

(1,  3) 

-1 

0 

-1 

0 

-1 

-1 

1 

1 

1 

1 

(2,  2) 

1 

1 

1 

1 

0 

-1 

0 

-1 

0 

-1 

(2,  3) 

0 

-1 

0 

-1 

1 

1 

-1 

-1 

-1 

-1 

(2,4) 

_-l 

-1 

-1 

-1 

-1 

-1 

1 

1 

1 

1  _ 

Example  6.  Simplified  Poker. 


Each  of  two  players  antes  one  unit.  They  obtain  a  fixed  hand  by  draw¬ 
ing  one  card  apiece  from  a  pack  of  three  cards  numbered  J,  Q,  K.  Then 
the  players  choose  alternately  to  bet  one  unit  or  to  pass  without  betting. 
Two  successive  bets  or  passes  terminate  a  play,  at  which  time  the  player 
holding  the  higher  card  wins  the  amount  previously  wagered  by  the  other 
player.  A  player  passing  after  a  bet  also  ends  a  play  and  loses  his  ante. 

There  are  six  possible  deals.  For  each  deal  the  plays  of  the  game  may 
be  diagrammed  as  shown  in  Fig.  1.  We  see  that  for  each  deal  there  are 
five  plays.  Thus,  thirty  possible  plays  exist  in  this  game. 


Figure  1 

A  strategy  for  the  player  choosing  first,  call  him  Blue,  must  tell  him 
whether  to  pass  or  bet  for  each  of  the  three  possible  cards  he  may  hold. 
A  convenient  way  to  code  Blue’s  strategy  is  by  a  vector  (aj,  csq,  as) 
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where  a,  gives  the  instructions  for  card  i  =  J,  Q,  K.  Then  a*  may  be  coded 
as  follows: 


cl%  ~~~ 


0  means  pass  in  round  1  and  pass  in  round  2 

1  means  pass  in  round  1  and  bet  in  round  2 

2  means  bet  in  round  1. 


It  is  clear  that  Blue  has  3  X  3  X  3  =  27  strategies. 

Similarly  we  can  represent  Red’s  strategies  by  a  vector  (bj,  bq,  I>k) 
where  b,  =  0,  1,  2,  3  gives  the  instructions  for  card  i. 


means  pass  if  Blue  passes,  pass  if  Blue  bets 
means  pass  if  Blue  passes,  bet  if  Blue  bets 
means  bet  if  Blue  passes,  pass  if  Blue  bets 
means  bet  if  Blue  passes,  bet  if  Blue  bets. 


Hence  Red  has  4  X  4  X  4  =  64  strategies. 


4.  RELATIONS  AMONG  EXPECTATIONS 

Suppose  Blue  has  m  strategies  and  Red  has  n  strategies;  then  the  game 
is  determined  by  the  m  X  n  matrix  A  =  (ay)  where  ay  is  Blue’s  payoff  if 
he  uses  his  fth  strategy  and  Red  uses  hisjth  strategy.  Since  we  are  assuming 
that  the  payoff  is  from  Red  to  Blue,  Red’s  payoff  is  —ay.  Blue’s  objective 
in  the  game  is  to  make  ay  as  large  as  possible,  whereas  Red  wants  to  make 
—  ay  as  large  as  possible,  or  ay  as  small  as  possible.  In  terms  of  this  payoff 
to  Blue,  we  may  refer  to  Blue  as  the  maximizing  player  and  Red  as  the 
minimizing  player. 

Now  for  any  strategy  i  which  Blue  may  choose,  he  can  be  sure  of 
getting  at  least 

min  ay, 

j  <n 

where  the  minimum  is  taken  over  all  of  Red’s  strategies.  Blue  is  at  liberty 
to  choose  i;  therefore,  he  can  make  his  choice  in  such  a  way  as  to  insure 
that  he  gets  at  least 

max  min  ay. 

i <m  j <n 

Similarly,  for  any  strategy  j  which  Red  may  choose,  he  can  be  sure 
of  getting  at  least 

min  (—ay)  =  —  max  ay. 

i  <m 

That  is,  for  any  strategy  j  which  Red  may  choose,  he  can  be  sure  that 
Blue  gets  no  more  than 

max  ay. 

i  <m 
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Since  Red  is  at  liberty  to  choose  j,  he  can  choose  it  in  such  a  way  that 
Blue  will  get  at  most 

min  max  a* y. 

j  <n  i  <m 

Therefore,  there  exists  a  way  for  Blue  to  play  so  that  Blue  gets  at  least 

max  min  a*/, 

i <m  j <n 

and  there  exists  a  way  for  Red  to  play  so  that  Blue  gets  no  more  than 

min  max  ay. 

j  <n  i  <m 

In  general,  these  two  quantities  are  different,  but  satisfy  the  relationship 
max  min  an  <  min  max  an- 

i  <m  j  <n  j  <n  i  <m 

We  prove  the  latter  inequality  as  follows:  Given  any  i,  then 


min  an  <  an 

j  <n 

for  all  j. 

Given  any  j,  then 

max  an  >  ay 

i<m 

for  all  i. 

Hence  we  have 

min  ay  <  an  <  max  ay 

j  <n  i  <m 

or 

min  ay  <  max  ay. 

j  <n  i  <m 

Since  the  right-hand  side  of  the  preceding  inequality 
i,  we  have,  by  taking  the  maximum  of  both  sides, 

is  independent  of 

max  min  ay  <  max  ay. 

i  <771  j  <n  i  <771 

Now  the  left-hand  side  of  the  preceding  inequality  is 
By  taking  the  minimum  of  both  sides,  we  have 

independent  of  j. 

(1.1) 

max  min  ay  <  min  max  ay . 

%  <m  3  <71  j  <71  i  <771 

Another  way  of  establishing  the  inequality  is  the  following:  Suppose 
max  min  =  a„ 

i  <m  j  <n 


min  max  =  aap. 

j  <n  i  <m 

Then  it  follows  that  ar8  is  the  minimum  of  the  rth  row,  or 

Clrs  ^ 

Also,  aal 3  is  the  maximum  of  the  /3th  column;  thus 

dr  ft  ^  daft* 
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Thus  we  have 

Clrs  ^  dj.fi  ^ 

and  the  inequality  is  proven. 

Examples,  (a)  Suppose  the  game  is  such  that  its  payoff  matrix  is 
given  by 


Then  max  min  a,-,-  =  1  and  min  max  a„-  =  2.  In  this  game,  Blue  can  receive 
at  least  1.  Blue  can  guarantee  this  amount  by  playing  his  third  strategy. 
The  most  Red  needs  to  pay  or  the  most  that  Blue  can  get  is  2.  Red  can 
assure  this  upper  bound  by  playing  his  fourth  strategy. 

(b)  In  Example  2,  the  game  Morra,  we  have 

max  min  a;,-  =  —  3  and  min  max  a,-,-  =  3. 

There  is  a  way  for  Blue  to  play  so  that  he  loses  no  more  than  3  and  for 
Red  to  play  so  that  Blue  wins  no  more  than  3. 


5.  SADDLE-POINTS 

If  it  happens  that  the  inequality  (1.1)  becomes  an  equality,  or  that 

(1.2)  max  min  a„  =  min  max  ay  =  v 

%  <m  j  <n  j  <n  i  <m 

then  Blue  can  choose  a  strategy  so  as  to  get  at  least  this  common  value, 
and  Red  can  keep  Blue  from  getting  more  than  v.  In  this  case,  there  are 
strategies  i*  and  j*  for  the  two  players  such  that,  for  all  i  and  j, 

(1.3)  a>ij*  Ui*j*  a i*j 

and  at*}*  =  v. 

Thus  Blue  cannot  do  better  than  to  choose  i*;  similarly,  Red  cannot  do 
better  than  to  choose  j*. 

We  refer  to  i*,  j*  as  optimal  strategies  of  Blue  and  Red,  respectively. 
The  optimal  strategies  have  the  following  properties : 

(i)  If  Blue  chooses  i*,  then  no  matter  what  strategy  Red  chooses, 
Blue  can  get  at  least  v. 

(ii)  If  Red  chooses  j*,  then  no  matter  what  strategy  Blue  chooses, 
Blue  can  get  at  most  v. 

(iii)  If  Blue  were  to  announce  in  advance  that  he  planned  to  play 
strategy  i*,  Red  could  not  thereby  take  advantage  of  this  infor¬ 
mation  and  reduce  Blue’s  payoff.  Similarly,  if  Red  were  to 
announce  j*,  Blue  could  not  increase  his  payoff. 
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If  the  condition  (1.2)  is  satisfied,  then  the  matrix  A  =  (a,,)  is  said  to 
have  a  saddle-point  at  i*,  j*  and  its  value  is  ai*j*  =  v.  We  call  v  the  value 
of  the  game.  It  represents  the  amount  that  Blue  should  pay  Red  at  the 
start  of  the  game  in  order  to  equalize  the  game  with  respect  to  winnings. 

It  is  easily  seen  that  conditions  (1.2)  and  (1.3)  are  equivalent:  thus 
condition  (1.2)  is  satisfied  if  and  only  if  there  exists  a  pair  of  strategies 
i*,  j*  satisfying  (1.3).  From  (1.3)  we  also  have  that 

max  Oij*  =  min  a,*y  =  v. 

i <m  j <n 

Hence  a  necessary  and  sufficient  condition  that  a  game  have  a  saddle-point 
is  that  there  exists  a  member  of  the  payoff  matrix  which  is  simultaneously 
the  minimum  of  its  row  and  the  maximum  of  its  column. 

A  game  may  have  several  saddle-points.  In  such  a  case  all  the  saddle- 
points  have  the  same  value.  Each  location  of  a  saddle-point  provides  an¬ 
other  solution  or  optimal  strategy. 

Example.  The  matrix 

"1  -3  -2“ 

2  5  4 

2  3  2_ 

has  a  saddle-point  in  its  lower  left-hand  corner,  since  this  payoff,  2,  is 
simultaneously  the  minimum  of  the  third  row  and  the  maximum  of  the 
first  column.  Hence  Blue  can  expect  to  get  at  least  2  in  this  game  (by 
choosing  the  third  row)  and  Red  can  keep  Blue  from  exceeding  2  (by  choos¬ 
ing  the  first  column).  These  strategies,  i*  =  3,  j*  =  1,  are  optimal  strat¬ 
egies  of  Blue  and  Red,  respectively.  The  value  of  the  game  is  2. 

Although  the  lower  right-hand  comer  element  of  this  payoff  matrix  is 
2,  which  is  the  value  of  the  game,  it  is  not  a  saddle-point.  This  element  fails 
to  be  the  maximum  payoff  in  its  column.  However,  i*  —  2,  j*  =  1  locates 
another  saddle-point. 

The  pair  of  optimal  strategies,  i*,  j*,  is  also  said  to  be  a  solution  of  the 
game  having  a  value  v  =  a&j*. 

Example  7.  Armaments. 

Suppose  that  fighter  planes  can  be  equipped  with  any  of  these  arma¬ 
ments — guns,  rockets,  and  toss-bombs — or  for  ramming.  These  fighters  are 
to  be  used  against  bombers  of  three  types:  full  firepower  and  low  speed, 
partial  firepower  and  medium  speed,  and  no  firepower  and  high  speed. 
We  wish  to  determine  the  best  type  of  armament  for  the  fighter  and  the 
best  type  of  bomber.  Assume  that  we  are  able  to  establish  the  effectiveness 
of  each  type  of  fighter  versus  each  type  of  bomber.  This  represents  the 
payoff  and  is  measured  by  a  bomber  attrition  factor  per  fighter. 

Let  Blue  be  the  Fighter  Command  which  has  four  strategies  and  let 
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Red  be  the  Bomber  Command  which  has  three  strategies.  A  strategy  is  to 
be  picked  by  each  player  independently  of  the  choice  of  the  other.  Let  the 
payoff  to  Blue  be  the  probability  that  the  fighter  destroys  the  bomber. 
Suppose,  for  example,  the  payoff  matrix  is  as  follows: 

Armaments  Payoff 


Bomber  Command  (Red) 


1. 

2. 

3. 

Full  fire, 

Partial  fire, 

No  fire, 

low  speed 

medium  speed 

high  speed 

1. 

Guns 

"  0.30 

0.25 

0.15 

Fighter 

Command 

2. 

Rockets 

0.18 

0.14 

0.16 

0.17 

(Blue) 

3. 

Toss-bombs 

0.35 

0.22 

4. 

Ramming 

0.21 

0.16 

0.10 

The  least  that  the  Fighter  Command  can  get  is  0.15,  0.14,  0.17,  or  0.10, 
depending  upon  whether  he  uses  guns,  rockets,  toss-bombs,  or  ramming. 
These  are  the  minima  of  the  rows.  Therefore,  Blue  will  pick  toss-bombs, 
since  they  yield  the  most,  0.17.  Now  the  most  the  Bomber  Command  will 
have  to  pay  is  0.35,  0.25,  or  0.17,  depending  on  which  firepower  he  adopts. 
These  numbers  are  the  maxima  of  the  columns.  Therefore,  by  picking  no 
firepower,  he  pays  the  least,  0.17. 

The  best  strategy  for  Blue  is  his  third  strategy,  toss-bombs.  The  best 
strategy  for  Red  is  also  the  third,  no  firepower.  The  value  of  the  game  to 
the  fighter  is  0.17.  If  Blue  uses  toss-bombs,  he  is  sure  of  getting  at  least 
0.17.  If  Red  uses  no  firepower,  he  is  sure  of  losing  no  more  than  0.17.  This 
is  the  best  that  both  players  can  do. 

6.  GAMES  WITH  PERFECT  INFORMATION 

We  have  seen  that  if  a  game  has  a  saddle-point,  it  is  formally  easy  to 
find  optimal  strategies  of  this  game.  We  look  for  an  element  in  the  payoff 
matrix  which  is  simultaneously  the  minimum  of  its  row  and  the  maximum 
of  its  column.  The  location  of  this  element  yields  optimal  strategies  of  the 
two  players.  Having  found  a  saddle-point,  we  have  determined  a  solution 
of  the  game.  Of  course,  the  size  of  the  payoff  matrix  determines  the  relative 
ease  of  finding  the  saddle-point. 

There  exists  a  large  class  of  games,  referred  to  as  games  with  perfect 
information,  that  have  saddle-points.  In  a  game  with  perfect  information, 
the  players  move  alternately  and  at  each  move  a  player  is  completely  in¬ 
formed  about  the  previous  moves,  chance  or  personal,  in  the  game.  Ex¬ 
amples  of  games  with  perfect  information  are  chess  and  backgammon. 
Bridge  and  poker  are  not  games  of  perfect  information  since  the  player 
does  not  know  the  cards  dealt  to  the  other  players. 
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Thus,  if  the  strategies  for  chess  were  enumerated,  we  could  find  optimal 
strategies  by  locating  the  saddle-points  in  a  payoff  matrix  whose  entries 
consist  of  +1,  0,  —1.  The  existence  of  a  saddle-point  in  the  game  of  chess 
follows  from  the  fact  that  it  is  a  game  with  perfect  information.  However, 
because  of  the  large  number  of  strategies  and  the  size  of  the  payoff  matrix, 
the  value  of  the  game  (which  will  be  +1,  0,  or  —1)  and  the  optimal  strate¬ 
gies  for  chess  have  not  been  computed. 


THE  FUNDAMENTAL  THEOREM 


1.  PRELIMINARIES 

We  have  seen  that  if  the  payoff  matrix  A  =  (atJj  is  such  that 

(2.1)  max  min  a#  =  min  max  a,y, 

i<m  j<n  j<n  i<m 

then  Blue  has  a  strategy  t*  and  Red  has  a  strategy  j*  such  that 

aw  >  cu*j*  for  all  j, 

(2.2)  .... 

cm*  <  cij*y*  tor  ail  i. 

We  called  i*,  j*  optimal  strategies  of  Blue  and  Red,  respectively,  and  the 
pair  i*,  j*  is  said  to  be  a  solution  of  the  game.  In  this  case  the  solution  can 
be  obtained  by  finding  the  element  of  the  matrix  («,,)  which  is  simultane¬ 
ously  the  minimum  of  its  row  and  the  maximum  of  its  column.  Further, 
even  if  a  player  announces,  prior  to  the  play  of  a  game,  the  optimal  strategy 
he  plans  to  use,  his  opponent  cannot  take  advantage  of  this  information. 
In  other  words,  if  (2.1)  holds  there  is  no  reason  for  secrecy  with  respect  to 
strategy  choices  on  the  part  of  either  player. 

Example.  In  the  game  whose  payoff  matrix  is 

”1  4  r 

2  3  4  , 

_0  -2  7_ 

there  is  a  saddle-point  at  i  =  2,  j  =  1  and  its  value  is  2.  If  Blue  an¬ 
nounces  he  plans  to  play  strategy  i  =  2,  then  Red  cannot  reduce  Blue’s 
winnings  below  2,  since  Red  will  then  do  best  by  choosing  j  =  1.  Sim¬ 
ilarly,  if  Red  announces  he  plans  to  play  his  first  strategy,  Blue  cannot 
increase  his  winnings  above  2,  since  Blue  will  then  choose  i  =  2. 

16 
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2.  GAMES  WITHOUT  SADDLE-POINTS 

Let  us  now  consider  games  whose  payoff  matrices  are  such  that 
(2.3)  max  min  ay  <  min  max  ay. 

i<m  j  <n  j  <n  i<m 

The  left-hand  side  of  the  inequality  represents  the  least  Blue  can  get  and 
the  right-hand  side  represents  the  most  Blue  can  get.  In  other  words, 
Blue  has  a  strategy  which  assures  him  of  getting  at  least 

max  min  a y 

i  <m  j  <n 

and  Red  has  a  strategy  which  assures  him  that  Blue  cannot  get  more  than 

min  max  ay. 

j  <n  %<m 

Since  these  two  quantities  are  unequal,  we  have  not  arrived  at  a  solution  to 
the  game. 

Example.  The  game  defined  by  the  payoff  matrix 

G  I] 

does  not  have  a  saddle-point.  For  here  we  have 

min  max  an  =  4, 

3 <2  i  <2 

max  min  ay  =  3, 

t  <2  j  <2 

and  max  min  ay  <  min  max  ay. 

Blue  can  guarantee  himself  a  payoff  of  3  units.  However,  Red  can  guar¬ 
antee  that  it  won’t  cost  him  more  than  4  units.  Therefore,  Blue  will  try  to 
get  more  than  3  units  and  Red  will  try  to  cut  his  cost  below  4  units,  or 
Blue  should  win  between  3  and  4  per  play. 

Let  us  examine  Blue’s  reasoning  in  this  situation.  Suppose  Red  could 
discover  Blue’s  optimal  strategy,  perhaps  by  computing  it.  Then  if  this 
strategy  is  Blue’s  first  strategy,  Red  can  drive  Blue’s  winnings  down  to 
1 ;  if  the  strategy  is  Blue’s  second  strategy,  his  winnings  will  be  3.  There¬ 
fore,  if  Blue’s  strategy  were  discovered,  his  winnings  would  be  either  1 
or  3  on  each  play.  However,  Blue  is  trying  to  get  between  3  and  4.  Thus, 
Blue  is  at  a  disadvantage  if  Red  knows  Blue’s  strategy. 

It  seems  then  that  Blue  should  concentrate  on  trying  to  prevent  Red 
from  discovering  the  strategy.  One  way  would  be  for  Blue  to  choose  his 
strategy  at  random.  Red  then  could  not  discover  Blue’s  strategy  because 
Blue  would  not  know  the  strategy  in  advance  of  the  play.  On  the  other 
hand,  if  Blue  knew  Red’s  strategy  then  Blue  could  get  either  5  or  4, 
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whereas  Red  is  trying  for  between  3  and  4  per  play.  Hence  Red  should  also 
choose  his  strategy  at  random. 

3.  MIXED  STRATEGIES 

In  a  game  without  a  saddle-point,  we  saw  that  a  player’s  choice  of  a 
strategy  will  depend  on  his  opponent’s  choice.  Therefore,  it  is  very  im¬ 
portant  that  a  player  find  out  his  opponent’s  choice  of  strategy— hence,  it 
is  essential  that  a  player’s  choice  of  a  strategy  be  unknown  to  his  opponent. 
Each  player  will  concentrate  on  keeping  his  own  intentions  secret.  One 
way  to  do  this  is  by  using  a  random  device  for  selecting  a  strategy. 

We  shall  now  extend  the  notion  of  a  strategy  which  will  satisfy  the 
preceding  discussion.  A  player,  instead  of  choosing  a  single  strategy,  may 
leave  the  choice  of  the  strategy  to  chance.  That  is,  he  may  choose  a  prob¬ 
ability  distribution  over  his  set  of  strategies  and  then  the  associated  random 
device  selects  the  particular  strategy  for  the  play  of  the  game.  Such  a 
probability  distribution  over  the  whole  set  of  strategies  of  a  player  is  a 
mixed  strategy. 

The  function  of  a  mixed  strategy  is  to  keep  the  opponent  from  dis¬ 
covering  the  strategy.  If  a  game  has  a  saddle-point,  then  there  is  no  dis¬ 
advantage  to  a  player  if  his  chosen  strategy  is  found  out  by  his  opponent. 
However,  if  the  game  does  not  have  a  saddle-point,  it  is  a  definite  disadvan¬ 
tage  for  a  player  to  have  his  strategy  discovered  by  his  opponent.  A  mixed 
strategy  provides  a  method  for  a  player  to  protect  himself  against  having 
his  particular  strategy  found  out  by  his  opponent;  choosing  from  several 
different  strategies  at  random,  with  only  their  probabilities  determined,  is 
an  effective  protection  against  being  found  out.  The  opponent  cannot 
learn  the  particular  strategy  in  advance,  since  the  player  does  not  know 
it  himself.  The  strategy  is  selected  at  the  last  moment  with  the  help  of  a 
random  device. 

The  game  now  requires  each  player  to  select  independently  a  mixed 
strategy.  The  outcome  will  now  be  measured  in  terms  of  expectation. 

We  shall  represent  mixed  strategies  by  column  matrices.  Let  be  the 
probability  of  selecting  strategy  i.  Then  a  mixed  strategy,  or  probability 
distribution  X,  for  Blue  may  be  represented  by  a  column  matrix 


where 


Xi  >  0, 


*  =  1,  2,  ... , 


m 
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m 

and  2  Xi  =  1. 

Similarly,  if  y,  is  the  probability  of  selecting  strategy  j,  then  a  mixed 
strategy  for  Red  is  a  column  matrix 

~V\ 

Vi 

:  - 
J/»_ 

where  y,  >  0,  j  —  1,2, ...  ,n 

n 

and  2  y,  —  1. 

i= i 

If  it  happens  that  x{  =  1  for  some  i,  then  x  is  called  a  pure  strategy. 

Having  defined  mixed  strategies  as  probability  distributions,  we  need 
to  compute  expected  payoffs.  Suppose  Blue  chooses  strategy  i  and  Red 
chooses  mixed  strategy  Y ;  the  expected  payoff  to  Blue  is 

n 

hi  =  2  ai3yj, 

3  =  1 


which  is  given  by  the  ith  component  of  the  column  matrix 


H  =  AY 


UmJ 


If  Red  uses  strategy  j  and  Blue  uses  mixed  strategy  X,  the  expected  payoff 
to  Blue  is 

m 

hj  2  UijXi, 

i=i 


which  is  the  jth  component  of  the  row  matrix  K' ,  where 
K'  =  X'A  =  (7d,  fa, ... ,  fa). 

If  Blue  and  Red  use  mixed  strategies,  X,  Y,  respectively,  then  the  expected 
payoff  to  Blue  is 

«  m 

E  =  X'AY  =22  aaxiiyi  -  K'Y  =  X'H. 

J—l 1=1 

Example.  In  the  Colonel  Blotto  payoff  matrix  (Example  4),  if  the 
enemy  uses  the  mixed  strategy 
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rn 


LiJ 


and  Colonel  Blotto  uses  a  pure  strategy,  Blotto’s  expectation  for  each  of 
his  pure  strategies  is  H  =  AY,  or 


"  4 

0  2  1" 

rn 

<N 

1 _ 

0 

4  1  2 

4 

0 

1 

2 

1 

l 

H  = 

1 

-1 

-13  0 

1  0  3 

- 

if 

1 

2 

—  2 

-2  2  2_ 

L.4  J 

_1  _ 

where  the  components  of  H  represent  Colonel  Blotto’s  receipts  corres¬ 
ponding  to  each  one  of  his  five  pure  strategies.  Now,  if  the  two  players’ 
strategies  are 


rjr 

4 

0 

0 

I 

4 

1 


Y  = 


4 

0 

1 

2 

l£j 


then  Blotto’s  expectation  is  E  =  X'AY  =  X'H,  or 

-2f 


[b  0,  0,  i  §] 


1 

1 3 

•*4, 

1 
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4.  GRAPHICAL  REPRESENTATION  OF  MIXED  STRATEGIES 

It  is  possible  to  represent  graphically  the  expectation  of  a  player  as  a 
function  of  his  mixed  strategies.  If  a  player  has  two  strategies,  the  graphical 
representation  is  two-dimensional;  for  three  strategies,  the  representation 
requires  three  dimensions.  For  example,  suppose  we  have  the  following 
payoff  matrix: 

Red  strategies 

0  0 

Blue  ©  ["  5  1  “I 

strategies  ©  [_  3  4  J 

We  can  represent  the  various  mixtures  of  Blue  strategies  ®  and  ©  by 
points  on  the  line  S\Si  of  unit  length  (Fig.  2).  Thus  Q  represents  f  of  ® 
and  j  of  ©.  If  Red  uses  his  strategy  0,  then  Blue’s  expectations  are  given 
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by  ordinates  PQ  of  the  points  P  of  AB  corresponding  to  each  of  Blue’s 
mixed  strategies.  Thus  PQ  —  4|  is  the  expectation  of  Blue  if  he  uses  a 
mixed  strategy  of  f  of  ®  and  \  of  ©  and  Red  uses  strategy  0. 


Figure  3 


If  Blue  uses  strategy  ©,  then  the  expectation  of  Blue  for  each  of  Red’s 
mixtures  of  0  and  0  is  given  (Fig.  3)  by  the  ordinates  of  CD.  Thus 
LM  —  3f  is  the  expectation  of  Blue  if  he  uses  strategy  ©  and  Red  uses 
a  mixture  M  consisting  of  |  of  0  and  -f  of  0. 


5.  THE  MINIMAX  THEOREM 

Having  introduced  mixed  strategies,  we  now  interpret  a  game  with 
payoff  matrix  A  =  (a#)  as  consisting  of  Blue’s  choosing  a  mixed  strategy 
X  and  Red’s  choosing  a  mixed  strategy  Y.  The  payoff  to  Blue  is  the 
expected  payoff  given  by 

n  m 

(2.4)  E  —  X' AY  =  S  2  aijXiyj. 

y = i  i  =  i 

Although  a  player  chooses  the  mixed  strategy,  the  particular  strategy 
which  eventually  plays  the  game  is  chosen  by  chance,  subject  to  the  given 
probability  distribution. 

Suppose  Blue  chooses  his  strategy  by  using  a  mixed  strategy  X.  Then 
he  can  expect  to  receive  at  least 

min  X' A  Y, 

Y 

where  the  minimum  is  taken  over  all  possible  mixed  strategies  available  to 
Red.  Now  since  Blue  has  the  choice  of  X,  he  will  select  X  so  that  this 
minimum  is  as  large  as  possible.  Hence  Blue  can  pick  a  mixed  strategy, 
call  it  X*,  which  will  guarantee  him  an  expectation  of  at  least 

max  min  X' A  Y, 
x  Y 

irrespective  of  what  Red  does.  Similarly,  for  each  mixed  strategy,  F, 
chosen  by  Red,  the  most  he  will  have  to  pay  to  Blue  is 

max  X'AY, 
x 
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where  the  maximum  is  taken  over  all  mixed  strategies  available  to  Blue. 
Now  Red  can  choose  F  so  that  the  latter  quantity  is  as  small  as  possible. 
Hence,  Red  can  pick  a  mixed  strategy,  Y*,  which  will  make  the  expectation 
of  Blue  at  most 

min  max  X'A  Y, 
y  x 

irrespective  of  what  Blue  does.  It  is  apparent  from  the  above  remarks, 
and  it  can  be  easily  shown,  that 

(2.5)  max  min  X'A  Y  <  min  max  X'A  Y. 

x  y  y  x 

The  minimax  theorem  states  that  these  two  quantities  always  have  a 
common  value,  v,  or  that 

(2.6)  max  min  X'A  Y  —  min  max  X'A  Y  —  v. 

x  y  y  x 

This  remarkable  result  is  the  fundamental  theorem  of  game  theory.  We 
shall  prove  this  theorem  in  section  8. 

6.  OPTIMAL  MIXED  STRATEGIES 

From  (2.6)  it  follows  that  Blue  has  a  mixed  strategy  X*  and  Red  has 
a  mixed  strategy  F*  such  that 

X*'AY  >  v  for  all  F, 

(2.7)  X'AY*<v  for  all  X, 

X*'AY*  =  v. 

The  pair  X*,  Y*  is  called  a  solution  of  the  game,  and  v,  the  value  of  the 
game.  We  also  refer  to  X*,  Y*  as  optimal  strategies  since 

(i)  If  Blue  uses  X*,  his  expectation  is  at  least  v,  irrespective  of  what 
Red  does. 

(ii)  If  Red  uses  F*,  he  can  make  Blue’s  expectation  at  most  v,  ir¬ 
respective  of  what  Blue  does. 

Since  X*’AY  >  v  for  all  F  and  X’A  F*<  v  for  all  X,  the  solution  also 
has  the  property  that  a  player  may  announce,  in  advance  of  the  play,  the 
mixed  strategy  to  be  employed,  and  the  opponent  will  be  unable  to  reduce 
the  expectation  of  the  player  as  a  result  of  this  extra  knowledge.  Of  course, 
this  announcement  gives  information  only  about  the  mixed  strategy  to  be 
employed.  It  does  not  give  any  information  about  the  pure  strategy  which 
will  actually  be  used  in  playing  the  game.  Neither  player  knows  the  partic¬ 
ular  pure  strategy,  since  that  is  left  to  a  chance  device. 

Thus,  if  Blue  uses  an  optimal  mixed  strategy  X *  and  Red  knows  this 
in  advance,  Red  need  not  shift  from  an  optimal  strategy  F*  as  a  result  of 
the  information.  A  similar  argument  applies  to  Blue. 
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Example.  In  Example  4,  the  Colonel  Blotto  game,  it  can  be  verified  by 
(2.7)  that  an  optimal  mixed  strategy  X*  of  Colonel  Blotto  and  Y*  of  the 
enemy  is  given  by 


[-A- 

9 

4 

9 

1  "1 

TS 

1 

0 

0 

II 

* 

Tg- 

4 

9 

4 

1 

l_9'j 

L  ¥  J 

Colonel  Blotto  should  pick  strategies  1,  2,  5  at  random  with  prob¬ 
abilities  $,  f,  respectively.  The  enemy  should  pick  strategies  1,  2,  3,  4 
at  random  with  probabilities  psi  tVi  f>  it,  respectively.  The  value  of  the 
game  is  A,4  to  Blotto;  i.e.,  Colonel  Blotto,  by  playing  optimally,  can  expect 
to  receive  at  least  regiments  from  the  enemy.  If  Colonel  Blotto  uses 
his  optimal  mixed  strategy,  then,  no  matter  what  the  enemy  does,  Colonel 
Blotto  receives  at  least  -©  regiments.  If  the  enemy  uses  his  optimal  mixed 
strategy,  then  Colonel  Blotto  may  play  any  strategy  except  his  third  and 
fourth  and  still  receive  regiments. 


7.  GRAPHICAL  REPRESENTATION  OF  MINI  MAX  THEOREM 

If  one  player  has  two  strategies  and  the  other  has  any  number  of  strat¬ 
egies,  it  is  possible  to  illustrate  graphically,  in  two  dimensions,  the  minimax 
theorem  and  also  solve  the  game.  For  example,  if  the  payoff  matrix  is 

Red 

strategies 

0  0 

Blue  ©  T  5  1  I 

strategies  ©  L  3  4  J 

the  graph  shown  in  Fig.  4  summarizes  the  expectation  of  Blue  for  various 
mixed  strategies  of  Blue  and  Red.  The  broken  line  AB  represents  Blue’s 
expectation  for  each  of  his  mixed  strategies  if  Red  uses  pure  strategy  0. 


A 
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The  broken  line  CD  represents  Blue’s  expectation  for  each  of  his  mixed 
strategies  if  Red  uses  strategy  0.  The  line  A'B'  represents  the  payments 
by  Red  for  each  of  his  mixed  strategies,  if  Blue  uses  pure  strategy  ® .  1  he 
line  C'D'  represents  the  payments  by  Red  if  Blue  uses  pure  strategy  ©. 

The  solid  lines  COB,  therefore,  represent  the  least  that  Blue  can  receive 
for  each  of  Red’s  mixed  strategies.  It  is  apparent  that  Blue  will  pick  the 
maximum  point  of  COB  and  Red  will  pick  the  minimum  point  of  A  0  D  . 
From  the  minimax  theorem  these  two  points  are  equal,  and  the  value  of 
the  game  is  OP  =  O'P'  =  3.4.  The  points  P  and  P'  give  the  solution  of 
the  game  and  correspond  to 


8.  PROOF  OF  THE  MINIMAX  THEOREM 

In  this  section  we  shall  give  a  proof  of  the  minimax  theorem,  the  funda¬ 
mental  theorem  of  games  of  strategy.  We  shall  prove  that  for  any  matrix 
A  —  (flu),  we  have  that 

n  m  n  m 

(2.8)  max  min  2  2  =  min  max  2  2  ay*<yy  =  v, 

X  Y  j=li=l  Y  X  j=\i=l 

where  X  and  Y  are  the  sets  of  probability  distributions  over  Blue’s  and 
Red’s  strategies,  respectively.  From  this  theorem  it  follows  that  every 
finite  zero-sum,  two-person  game  has  optimal  mixed  strategies. 

There  exist  many  proofs  of  this  fundamental  theorem.  Some  of  the 
proofs  use  fixed-point  theorems;  others  are  based  on  a  separation  theorem 
of  convex  sets.  Almost  all  these  proofs  are  existence  proofs — they  prove 
the  existence  of  optimal  strategies  but  are  not  constructive  from  the 
viewpoint  of  practical  computation. 

We  shall  give  an  elementary  proof  of  the  minimax  theorem  which  is 
purely  algebraic,  requiring  nothing  more  advanced  than  the  notion  of  an 
inverse  of  a  matrix.  At  the  same  time,  the  proof  will  provide  an  efficient 
method  for  computing  optimal  mixed  strategies  for  both  players. 

First,  let  us  derive  a  relationship  that  exists  between  the  maximum 
over  mixed  strategies  and  the  maximum  over  pure  strategies.  Suppose 
<f>i,  <j>t ,  ,<t>m  are  m  arbitrary  numbers,  and  let  X  =  (xl;  x2,  ...  ,  xm)  be 

a  mixed  strategy  of  m  pure  strategies.  Then  for  all  i 

<j>i  <  max  <t>i  =  4>k, 

i 

where  the  max  is  taken  over  the  m  strategies.  Multiplying  by  Xi  and 
summing,  we  get 

m 

S  <  max  =  <f)k 
1  i 
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or  that 

to 

max  2  4>iXi  <  max  <j>i  =  <f>k- 
x  »  =  1 

Using  the  particular  mixed  strategy  which  assigns  a  probability  of  1  to 
the  fcth  strategy,  we  have 

max  2  4>iXi  >  •  0  +  fa  •  0  +  .  . .  +  •  1  +  .  •  •  +  <£n  ■  0  =  fa. 

X  »  =  1 

Hence  it  follows  that 

m 

(2.9)  max  2  <t>iXi  =  max  <tn. 

X  i  =  1  < 

Similarly,  we  have 

n 

(2.10)  min  2  =  min  4>j. 

Y  j  =  1  j 

Equivalent  'problem.  Let  Z(X)  and  m(U)  be  defined  as  follows: 

n  m  to 

l(X)  =  min  2  2  aijX<yj  =  min  2  aijXi, 

,  Y  >« 1»=1  i  *=i 

n  m  n 

u(Y)  —  max  2  2  a^x^jj  =  max  2  a^j. 

X  j=l i=l  i  y=i 

Then,  from  the  definition  of  a  maximum  and  minimum,  we  have  the 
following  two  inequalities: 

TO 

2  a.jT;  >  l(X)  for  all  j, 

»=i 

n 

S  <  u(Y)  for  all  i. 

j=i 

Now  multiplying  the  first  inequality  by  yj  and  summing  over  j,  and  multi¬ 
plying  the  second  inequality  by  Xi  and  summing  over  i,  we  get 

l(X)  <  u(Y). 

In  terms  of  these  variables,  the  minimax  theorem  states  that 

max  l(X)  —  min  u{Y). 
x  Y 

Now  suppose  we  can  exhibit  a  pair  of  mixed  strategies,  X *  and  Y*, 
with  the  property  that 

l(X*)  =  u(Y*). 

Then  since 

UX*)  <  max  l(X)  <  u(Y*), 
x 

l(X*)  —  max  l(X). 
x 


it  follows  that 
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Similarly, 

u(Y*)  =  min  u(Y). 

Therefore  the  minimax  theorem  is  essentially  a  statement  of  the  fol¬ 
lowing  equivalent  problem:  Find  an  m-component  vector  X*  and  an 
n-component  vector  Y*  whose  components  Xi  and  y3,  respectively,  satisfy 


the  following 

seven  conditions:  * 

I. 

Xi  >  0, 

i  =  1, 

2,.. 

• ,  to; 

II. 

m 

S  Xi  ~  1  j 
t*=  i 

III. 

l(X*)  <  2  anXi, 

*= i 

j  =  1, 

2,.. 

.  ,n; 

(2.11)  IV. 

Vi  >  o, 

j  =  1, 

2,.. 

.,n; 

V. 

S  Vi  =  l; 

3  =  1 

VI. 

If  any i  <  u(Y*), 

3  =  1 

*=  1, 

2,.  . 

• ,  to; 

VII. 

l(X*)  =  u(Y*). 

Our  proof  consists  of  constructing  an  X*  and  Y*  satisfying  the  seven 
relations  above. 

Basis.  We  begin  the  proof  by  defining  the  augmented  matrix  G  of  the 
game  matrix  A  =  (a,-,)  as  follows: 


(2.12) 


io 

B  i 

b2 

.  • 

Bn+i 

r  n+2 

•  •  •  n+w 

“  0 

1 

1 

.  .  1 

0 

0 

0  “ 

-1 

On 

a\2 

•  •  CL\ n 

1 

0 

0 

-1 

an 

0.22 

•  .  0-2n 

0 

1 

... 

_-l 

ami 

0  m  2 

•  •  Clmn 

0 

0 

l 

This  matrix  has  m  +  1  rows  and  n  +  to  +  1  columns.  Let  us  label  the 
initial  row  of  G  as  i  =  0.  Let  us  label  the  columns  from  left  to  right  as 
follows: 


Po‘,  Pi,  •  •  •  ,  P n',  P n+1  —  Ul,  U2,  •  •  •  ,  Um  —  Pn+m, 


where  Ui  are  vectors  with  1  as  the  ith  component  and  zeros  elsewhere. 
Further,  let  us  assume  that  we  have  originally  arranged  the  rows  of  A 
such  that 


(2.13) 


am i  =  max  an. 

i<m 
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Suppose  we  choose  m  +  1  columns  of  G.  They  determine  an  m  +  1 
square  matrix  B.  We  shall  call  B  a  basis  if  B  satisfies  the  following 
conditions: 


B\.  P0  is  included  as  the  initial  column  of  B ; 

B-2.  B  is  nonsingular — i.e.,  B  has  an  inverse,  B~l; 

Bz.  Each  row  of  B~l,  except  possibly  the  initial  row  which  we  label  as 
i  =  0,  has  its  first  nonzero  component  positive. 


An  example  of  a  basis  is  the  following  matrix: 


(2.14)  B°  =  (P„,  I\,  tV. ,  Um- 1)  = 


0 

1 

0 

0 

...  0" 

-1 

an 

1 

0 

...  0 

-1 

U21 

0 

1 

...  0 

• 

0 

... 

• 

• 

...  1 

_-l 

0  ml 

1  Ti 

0 

0 

...  0_ 

(2.15) 


m-1  = 


0  ... 

0 

-11 

1 

0  ... 

0 

0 

~R°o~ 

R°i 

bi 

1  ... 

0 

-1 

b2 

0  ... 

0 

-1 

= 

■ 

•  .  . . 

0 

• 

„ R°m _ 

_bm~l 

0  ... 

1 

-1_ 

where  bt  =  am]  —  a»i.  We  can  also  express  the  inverse  matrix  in  terms  of 
the  row  vectors  R°t.  From  (2.13)  it  follows  that  bt  >  0;  hence  every  row  of 
(P0)-1,  with  the  possible  exception  of  i  =  0,  has  its  first  nonzero  component 
positive. 


Ordering  of  vectors.  Condition  B3  in  the  definition  of  a  basis  can  be  ex¬ 
pressed  as  a  comparison  of  vectors  with  the  null  vector.  Later  in  the  proof 
we  shall  find  it  convenient  to  compare  two  arbitrary  vectors — i.e.,  given 
two  vectors  which  are  different,  we  wish  a  rule  for  selecting,  say,  the  larger 
vector.  We  do  this  by  defining  a  suitable  ordering  relationship  among 
vectors. 

First,  let  us  define  equality  of  vectors.  Two  vectors  A  =  (af)  and 
B  =  (6,)  are  equal  if  and  only  if  a,-  —  bi  —  0  for  every  i.  That  is,  A  is  equal 
to  B  (written  A  —  B)  if  and  only  if  the  vector  (A  —  B)  is  the  null  vector, 
0.  Now  suppose  the  two  vectors  A  and  B  are  not  equal,  then  the  vector 
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(. A  —  B)  contains  at  least  one  nonzero  component.  If  the  first  nonzero 
component  of  ( A  —  B)  is  positive,  then  we  shall  say  A  is  greater  than  B, 
and  write  A  >  B.  If  the  first  nonzero  component  of  (. A  —  B)  is  negative, 
then  A  is  smaller  than  B  and  we  write  A  <  B. 

If  the  rows  of  the  inverse  of  a  basis  B  are  denoted  by  Ro,  Ri,  ,  Rm, 
then  the  condition  B3  defining  a  basis  states 

(2.16)  Ri  >0,  i  =  1, 2, .  .  . ,  m. 

From  the  above  method  of  ordering  it  follows  that  the  minimum  vector 
of  a  series  of  vectors  is  that  vector  whose  first  component  is  least;  if  there 
is  a  tie,  then  one  compares  the  second  components  of  the  tying  vectors,  etc. 

Optimal  basis.  Let 

B  —  ( Po ,  Pjl,  Pjl,  •  •  •  )  Pjm) 
be  a  basis  whose  inverse  is 


where  the  R’s  are  row  vectors  and  the  C’s  are  column  vectors.  Then  from 
the  fact  that 

B~lB  =  7, 

where  7  is  the  unit  matrix,  it  follows  that 

RkPj,  =  0  for  i  5*  k, 

RiPji  =  1,  i  =  k  =  0,  1,  2, .  .  . ,  m, 

where  jo  =  0. 

In  particular,  we  have 

(2.17)  RoPji  =  0  for  ji  -  ji,  j2,  .  .  . ,  jm. 

Now  suppose  we  form  the  n  +  m  scalar  products 

RoPj,  j  =  1,  2, .  .  . ,  n  +  m. 

From  (2.17)  it  follows  that  at  least  m  of  these  products  will  be  zero.  If  the 
remaining  n  scalar  products  are  no  larger  than  zero,  then  we  shall  call  B 
an  optimal  basis.  As  will  be  shown,  an  optimal  basis  yields  optimal  mixed 
strategies  of  the  players.  In  other  words,  if  the  basis  B  is  such  that 

RoPj  <0,  j  -  1,  2, .  .  . ,  n  +  m, 

then  B  is  an  optimal  basis. 

Optimal  strategies.  Let  B  be  an  optimal  basis.  Let  the  components  of 
the  0-row  and  0-column  of  B~l  be  denoted  as  follows: 
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(2.18)  R0  (If  %1}  %2f  •  •  ■  j  %m) } 

(2-19)  Co  =  (u,  yh,  yh, .  .  .  ,  y ,J. 

Wc  shall  show  that  an  optimal  mixed  strategy  X*  for  Blue  is  obtained 
from  the  row  Ro  by  setting 

X*  =  (xh  X2,  ,  xm). 

An  optimal  mixed  strategy  F*  =  (yh  y2,  ... ,  yn)  for  Red  is  obtained  from 
the  column  C0  by  setting 

V,  =  Vu  for  ji  <  n, 

=  0  for  all  other  j. 

We  shall  prove  that  X*  and  Y*  are  optimal  mixed  strategies  of  the 
original  game  by  showing  that  X*  and  Y*  as  defined  above  satisfy  the 
seven  conditions  I-VII  of  (2.11). 

Suppose  we  have  an  optimal  basis;  let  us  form  the  products  R0Pj  for 
different  values  of  j. 

If  j  =  0,  we  have 

m 

1  =  RJ\  =  2  *if 

t=i 

hence  condition  II  is  satisfied. 

If  1  <  j  <  n,  we  have 

m 

0  ^  R0P j  =  1  2  d  ijX if 

i  —  1 

which  satisfies  condition  III. 

If  n  +  1  <  j  <  n  +  m,  we  have 

0  >  RoPj  =  RoUi  =  —Xi  for  1  <  i  <  m, 

hence  condition  I  is  satisfied. 

Now  since  B  is  a  basis,  we  have 

Ri>  0,  i  =  1,  2,  .  .  . ,  m. 

In  particular  the  first  component  of  each  /?,■  is  non-negative.  But  from 
(2.17)  the  first  component  of  each  Ri  is  yj{.  Hence  IV  is  satisfied. 

Since  BB~l  —  I,  we  have  in  particular  that 

BC0  =  U0. 

From  this  matrix  equation  we  get  m  +  1  linear  equations  in  the  m  +  1 
variables  ( u ,  y y„  yn, .  .  . ,  yjm).  The  first  of  these  equations  is 

n 

S  Vi.  =  2  Vi  =  1, 

ji  <71  j  =  1 

since  the  first  component  of  each  Pj,  is  1  for  1  <  ji  <  n  and  0  otherwise. 
Hence  V  is  satisfied.  Now  the  remaining  m  equations  may  be  written  as 
follows : 
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-U+  s  din  Vn  +  2  =  0,  i  =  1,2,  ...  ,rn, 

jt  <n  jk  >n 

where  Sjt  >  0.  It  follows  that 

n 

2  diflji  <u,  i  =  1,  2, .  .  . ,  m 

3  =  1 

and  VI  is  satisfied.  Finally  VII  is  satisfied  since  u  and  l  are  both  defined  by 
the  same  element  of  B~x. 

Constructing  an  optimal  basis.  The  problem  of  finding  optimal  strategies 
has  now  been  reduced  to  the  problem  of  constructing  an  optimal  basis. 
The  inverse  of  such  a  basis  will  yield  optimal  strategies.  We  shall  now 
describe  an  iterative  procedure  for  constructing  an  optimal  basis  and  its 
inverse. 

The  iterative  procedure  starts  with  some  basis,  for  example  B°  as  de¬ 
fined  in  (2.14).  If  B°  is  not  optimal  then  we  construct  from  B°  a  new  basis, 
B1,  which  differs  from  B°  by  only  one  column.  Further,  if  Rl  is  the  0th 
row  of  ( Bl)~x ,  the  inverse  of  B\  then  Rh  will  have  the  property  that 

(2.20)  Rl  >  Rl 

If  B 1  is  not  optimal,  then  we  iterate  the  preceding  algorithm  for  Bl,  etc. 

This  process  generates  a  sequence  of  bases.  From  (2.20)  it  follows  that 
no  basis  can  be  repeated.  Further,  the  number  of  bases  cannot  exceed  the 
number  of  ways  of  choosing  to  columns  out  of  w  +  to  columns  of  the  aug¬ 
mented  matrix  G.  However,  we  shall  terminate  the  process  when  we  have 
arrived  at  an  optimal  basis,  which  is  always  possible,  as  will  be  shown. 

Suppose  the  basis  B°  is  not  optimal,  then  there  exists  some  Pj  in  G  such 
that  RoPj  >  0.  To  construct  Bl  from  Ba  we  let  P„  replace  a  column  of 
B°,  where  P,  is  determined  by  the  condition 

(2.21)  RoPs  =  max  R?,Pj  >0,  j  =  1,  2, .  .  .  ,  n  +  to. 

3 

In  case  the  choice  of  Ps  is  not  unique,  then  choose  P,  with  the  smallest 
index. 

Next  compute  the  column  vector  V  =  (v0,  vh  .  .  .  ,  vm)  satisfying  the 
equation  B°V  =  Ps.  Hence  we  have 

(2.22)  Vi  =  RIP.,  i  =  0,  1,  .  .  . ,  to. 

In  particular,  we  have 

Vo  =  RoP s  3>  0. 

Further,  v{  >  0  for  some  other  i  ^  0.  For,  if  we  assume  the  contrary, 
namely  that  v,  <  0  (i  ^  0),  then  from  the  relationship 

m 

Ps  «  B»v  =  voPo  +  S  ViPfr 

i=  1 


it  follows  that 
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Po 


Hence  the  column  P0  can  be  written  as  a  positive  linear  combination  of  m  + 1 
columns  of  G.  From  the  definition  of  G,  it  is  clear  PQ  cannot  be  written  as 
a  positive  linear  combination  of  the  other  columns,  and  so  we  have  a 
contradiction. 

We  now  choose  to  drop  from  B°  that  column  Pj,  such  that 
(2.23)  min  —  —  —  for  Vi  >  0,  vT  >  0,  i  iA  0. 


From  the  above  it  follows  that 


(2.24) 


ffl  -  -  R°r  >  0  for  Vi  >  0. 

Vr 


We  have  constructed  B 1  from  B°  by  dropping  from  B°  column  Pj,  and 
adding  column  Ps,  as  defined  above.  We  need  to  show  that  B1  is  a  basis. 
We  shall  do  this  by  constructing  [P1]-1  from  [P0]-1. 

Let  the  rows  of  [P1]-1  be  designated  by  R}  for  *  =  0,  1,  2,  .  .  . ,  m.  We 
shall  now  verify  that  we  can  get  R }  from  R?  and  vt  by  letting 


(2.25) 


Rt  =  fi?  -  -  R°r  for  i  *  r, 

Vr 

pi  _  J:  po 

lXr  —  Jtvf, 

vT 


Using  [P1]-1  as  defined  by  (2.25)  we  shall  verify  that  [B1]~1B1  =  I.  First, 
if  i  ^  r  and  j,  5^  s, 

R\PU  =  RlPu  -  -  R°rPji  =  0  for  i  *  k, 

Vr 


R\Pn  =  RtPj„  -  -  R°rPjt  =1-0  =  1  for  i  =  k. 
vr 

Second,  if  i  ^  r  and  j,  =  s,  we  have 


RIP ,  =  RU\  -  -R°rPs  =  Vi  -  -vr  =  0. 

Vr  Vr 


Thirdly,  if  i  =  r  and  ji  ^  s,  we  have 


R\Pji  =  7  R°rPj<  =  0. 


Finally,  if  i  —  r  and  ji  =  s,  we  have 


Therefore  (2.25)  yields  the  inverse  of  the  matrix  B1. 
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In  order  to  complete  the  proof  that  B  is  a  basis,  we  need  to  show  that 
R}  >  0  for  i  =  1,2,...,  m.  Now  for  i  =  r,  we  have  R°  >  0  and  v,  >  0. 
Hence 

R\  =  -  R°r  >  0 . 

Vr 

If  i  r,  and  if  vt  <  0,  then 

R\  =  R?  ~-R°r>  0. 

Vr 

If  i  7^  r,  and  if  Vi  >  0,  then  from  (2.24)  we  have 

R\  =  R?  -  -  >  0. 

Vr 

Therefore  B1  is  a  basis. 

Finally,  we  need  to  show  that  no  basis  can  be  repeated  in  this  process. 
It  is  sufficient  that  (2.20)  is  satisfied,  or  that  Rl  <  /r'o-  This  follows  from 
the  fact  that  R°r  is  a  row  of  a  nonsingular  matrix  and  hence  must  possess 
at  least  one  nonzero  component,  the  first  of  which  must  be  positive.  Since 
Vo  >  0  and  vr  >  0,  we  have  that 

Rh  =  R°o-  R°r  <  R°0. 

Vr 

Note  that  if  Pjr  is  a  column  removed  from  /i°,  then  we  have 
RlPjr  =  R°0P:,  -  ^  R°rPjr  =  0  -  -°  <  0. 

VT  vr 

We  also  have 

RlPj  =  0 

for  O'  =  ji,j2)  .  .  . ,  s, .  .  . ,  jm). 

Thus  at  least  m  +  1  columns  of  the  m  +  n  columns  now  have  the  property 
RqP]  <  0.  This  compares  with  at  least  m  columns  having  this  property 
in  the  previous  stage.  Further,  on  the  next  iteration  this  P,-r  cannot  return 
to  a  basis  since  a  candidate  s  for  the  basis  must  satisfy  RoP„  >  0.  This 
completes  the  verification. 

Example  8.  A  Guessing  Game. 

Let  us  apply  the  computational  method  to  the  following  game:  Blue 
secretly  picks  one  of  three  numbers  1,  2,  3.  Red  then  proceeds  to  guess  the 
number  picked  by  announcing  his  guess.  Each  time  Red  announces  his 
guess,  Blue  answers  “high,”  “low,”  or  “correct,”  as  the  case  may  be. 
The  game  continues  until  Red  has  guessed  correctly.  The  payoff  to  Blue 
is  the  number  of  guesses  required  by  Red  to  identify  the  number. 

A  strategy  for  Blue  is  the  choice  of  a  number  1,  2,  or  3.  A  strategy  for 
Red  may  be  represented  by  a  triplet  ( G ;  H,  L)  where  G  is  the  number 
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guessed  the  first  round,  H  is  the  number  guessed  the  second  round  if  Red 
hears  “high,”  and  L  is  the  number  guessed  the  second  round  if  Red  hears 
“low.”  It  is  clear  that  the  game  can  be  terminated  in  two  rounds.  Thus 
Red  has  five  strategies, 

(1;  0,  2),  (1;  0,  3),  (2;  1,  3),  (3;  1,  0),  (3;  2,  0), 
where  0  means  “not  applicable.” 

The  payoff  to  Blue  is  the  number  of  guesses  required  by  Red  to  get  a 
“correct”  response  from  Blue.  The  payoff  matrix  is  the  following: 

Guessing  Game  Payoff 


Blue’s 

choices 


Red’s  choices 

(102)  (103)  (213)  (310)  (320) 

1  r  1  1  2  2  3 

2  2  3  1  3  2 

3  3  2  2  1  1 


The  augmented  matrix  associated  with  this  game  is  given  below: 


Po 


G  = 


Pt  P2 
1  1 
1  1 

2  3 

3  2 


P3  Pi 
1  1 
2  2 

1  3 

2  1 


P6  Pe 
1  0 
3  1 

2  0 
1  0 


Pi  Ps 
0  0  ~ 
0  0 
1  0 
0  1 


First  iteration.  The  initial  basis  B°  consists  of  (P0,  Pi,  Pt,  Pt),  or 

r  o  i  o  on 


-1  3  0  0. 


The  inverse  of  B°,  or  (B0)-1,  is  computed  from  (2.15) : 


“3 

0 

0 

-r 

”Bo” 

1 

0 

0 

0 

B? 

2 

1 

0 

-1 

R°2 

_1 

0 

1 

-i_ 

_BL 

Using  this  inverse,  we  compute  BoP,-  for  all  j  ^  0.  We  find  that 

max  BoP,  =  B0P4  =  2. 

j  3^0 


Therefore  P4  replaces  some  column  in  B°.  To  determine  the  column  to  be 
dropped,  we  first  compute  the  vector  V  =  (v,)  where  v,  —  B?P4. 
Substituting,  we  get 

Vo  —  2,  Vi  =  1,  vz  =  3,  v3  =  3. 
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Next  we  obtain 


.  R? 

min  — 

Vi  >0  Vi 
1*0 


3  ’ 


hence  Pi  is  dropped  from  the  basis. 
Second  iteration.  The  next  basis  Bl  is 


To  get  its  inverse  we  set 


We  then  get 


B1  =  (P0,  Ph  Pe,  P4). 


pi  _  po  po 

ill  —  ill  —  iCa 

Vs 

pi  „  J_  po 
il3  —  il3- 

Vz 


for  i  ^  3, 


(P1)-1  = 


-R'0~ 

r7  n  —  2  _1“| 

^  U  3  3 

Rl 

2  A  _  1  1 

3  u  3  3 

Rl 

11-1  0 

_PL 

in  i  _  i 

l_3  u  3  3-J 

After  computing  PjP,  for  all  j  ^  0  we  find  that 

max  RlPj  =  RlPz  =  1. 

3*0 

Therefore  P3  replaces  some  column  in  B1.  The  new  vector  V  can  now  be 
computed.  Its  components  are 

v0  =  1,  Vx  =  1,  v2  =  2,  Vz  =  0. 

Using  these  values,  we  find 

Rl 


.  Rl 

min  —  =  -r-. 
p(>o  Vi  2 

i  5^0 


Hence  P6  is  dropped  from  BK 

Third  iteration.  We  have  B2  =  (P0,  Pi,  P3,  P4).  The  inverse  of  P2 

will  be 


(P2)'1 

Using  this  inverse,  we  compute  that 


“11 

“6" 

_ 3 

1 

6 

—  2  “1 

1 

_ 3 

1 

2 

6 

6 

3 

¥ 

3 

_ 3 

¥ 

0 

2 

L  ^ 

0 

2 

6 

_ 2 

6— J 

max  RlPj  =  RlPz  —  J, 

jV  0 


%  =  I, 


—  ■§> 


vz  = 


v3— 


p  2  p2 

ill  /Cl 


mm  —  = 

u*>0  V{ 
i?*  0 


THE  FUNDAMENTAL  THEOREM  35 


Fourth  iteration.  We  have  B 3  =  (P0,  P2,  P3,  Pi)  and 


We  find  that 

RlPj  <  0  for  all  j  ^  0. 


Therefore  a  solution  to  the  game  has  been  obtained.  The  optimal  strategies 
are  X*  —  (f,  f),  obtained  from  the  top  row  of  (P8)-1  and  F*  =  (0, f, 

i,  0),  obtained  from  the  first  column  of  (P3)-1.  The  value  of  the  game, 
obtained  from  the  upper  left  corner,  is  f . 


3  PROPERTIES  OF 

OPTIMAL  STRATEGIES 


t.  MANY  OPTIMAL  STRATEGIES 

From  the  minimax  theorem  it  follows  that  every  finite  game  has  a  so¬ 
lution  in  mixed  strategies.  In  some  cases,  depending  on  the  payoff  matrix, 
the  game  may  have  many  solutions.  In  this  chapter  we  study  the  properties 
of  this  set  of  solutions.  We  shall  first  examine  some  properties  of  an  op¬ 
timal  strategy  and  then  analyze  the  properties  of  the  set  of  optimal 
strategies. 

2.  SOME  PROPERTIES  OF  AN  OPTIMAL  STRATEGY 

Suppose  X*  —  (x\)  and  F*  =  (?/*)  are  optimal  strategies  of  Blue  and 
Red,  respectively.  If  x\  >  0,  then  strategy  i  may  be  played,  depending  on 
the  outcome  of  the  randomization.  If  x\  —  0,  then  strategy  i  is  not  played. 

Let 


~u\  ~ 

rq 

ul 

n 

U *  =  AY*  = 

,  L*  =  A'X*  = 

* 

m  _ 

JL 

Thus 

U*  =  (w*)  =  ( S  any)) 
i-i 

is  a  vector  whose  components  represent  Blue’s  expectation  if  he  uses  a  pure 
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strategy  i  and  Red  uses  an  optimal  strategy  Y*.  Similarly,  the  components 
of 


m 

L*  =  (fj)  =  ( s  diA) 

t  =  l 


represent  Blue’s  expectation  if  he  uses  an  optimal  mixed  strategy  X*  and 
Red  uses  a  pure  strategy  j. 

From  the  minimax  theorem  we  have 


(3.1)  max  2  any)  —  min  2  a-iA  =  v- 

i  3  =  1  J  *  =  1 

Hence  we  have 

(i)  max  u\  =  min  Tj  =  v.  There  exists  at  least  one  pure  strategy  for 

ij 

each  player  which,  if  used  against  his  opponent’s  optimal  mixed 
strategy,  yields  the  value  of  the  game. 

From  (3.1)  we  also  get 

(ii)  u\  <v  <  1]  for  all  i  and  j.  Against  an  opponent’s  optimal  mixed 
strategy  a  pure  strategy  cannot  yield  a  higher  expected  payoff  than 
his  optimal  mixed  strategy. 

Since  ut  <  v  for  all  i,  let  us  designate  by  Si  the  set  of  Blue  strategies 
for  which  u\  <  v,  and  by  S2  the  set  of  Blue  strategies  for  which  u\  =  v. 
Then,  by  the  minimax  theorem,  we  have 

TO  ^  * 

v  =  S  u\x\  =  v  S  +  2  uixi 
«  =  1  S,  Si 

or  «>(1  —  2  aS)  =  2  ««*«• 

Ss 

Now  from  the  definition  of  Si  and  S 2,  it  follows  that 

1-24  =  2  x*4. 

Si  Si 


Hence  we  get 

v  2  z4  =  2  uWt 

Si  Si 

or 

(3.2)  2  (v  -  ut)xt  =  0. 

Si 

Since  v  —  >  0  for  every  i  in  Si,  it  follows  from  (3.2)  that  x\  =  0  for 

every  i  in  Si.  We  have  shown  that 

(iii)  If  u\  <  v,  then  x\  =  0.  A  player’s  optimal  mixed  strategy  con¬ 
tains  no  pure  strategy  which  yields  less  than  the  value  of  the  game 
when  that  pure  strategy  is  used  against  an  opponent’s  optimal 
mixed  strategy. 
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Since  x\  >  0  for  all  i,  we  can  express  the  game  value  v  as  follows: 


m 

v  =  S  utx*i  =  S  u\x\ 

i  =  l  xt*  >0 

S*  *  i  «  ♦  ♦ 

UiXt  +  2  UtXi. 

Xi*  >0  x,*  >0 

Ui*=»  «*.■*<» 

Hence 

v  —  v  2  x\  +  2  UiX*t, 

Ri  Ri 

where  Ri  represents  the  set  of  Blue  strategies  for  which  x*  >  0  and  u\  <  v, 
and  Ri  is  the  set  of  Blue  strategies  for  which  x\  >  0  and  u\  =  v.  But  since 

2  x\  +  2  =  1, 

Ri  R2 

we  get 

v  2  xt  =  2  utXi. 

R\  Ri 


Therefore 

2  (v  —  u^x]  =  0. 

Ri 

Since  we  have  assumed  x\  >  0,  and  v  —  u\  >  0  for  each  i  in  Rh  it 
follows  that  Ri  is  vacuous.  We  have 

(iv)  If  x\  >  0  then  u\  —  v.  If  a  pure  strategy  is  a  member  of  an  opti¬ 
mal  mixed  strategy  it  yields  the  value  of  the  game  when  used 
against  an  opponent’s  optimal  mixed  strategy. 

Example.  For  the  Colonel  Blotto  game  (Example  4)  we  have  the  payoff 
matrix 


A  = 


0  2  1“ 
4  1  2 

-13  0, 
1  0  3 
-2  2  2 


for  which  the  value  of  the  game  is  v  —  to  Blotto.  The  following  vectors 
can  be  readily  verified. 


Note  that  ul  =  Jg2-  <  -1/-  =  v  and  x\  =  0;  also,  u\  =  J92-  <  -t,4-  =  v  and 
xt  =  0. 
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3.  CONVEX  SET  OF  OPTIMAL  STRATEGIES 

Suppose  a  player  has  more  than  one  optimal  strategy.  Let  x*  and  x\  be 
two  optimal  mixed  strategies  of  Blue.  If  v  is  the  value  of  the  game,  then 
from  the  property  of  an  optimal  strategy 

AT  AY  >  v  and  Xt'AY  >  v  for  all  F. 

Let  X  be  an  arbitrary  number  such  that  0  <  X  <  1.  Then  we  have 

XXfAF  +  (1  -  \)X l' AY  >  v  for  all  F. 

Thus 

(3.3)  [XXf  +  (1  -  \)XZ]AY  >  v  for  all  F. 

Hence,  for  every  value  of  X,  the  mixed  strategy  [XAf  +  (1  —  X)XI],  which 
is  a  convex  linear  combination  of  X*  and  X|,  is  also  an  optimal  strategy 
for  Blue.  Since  X  was  arbitrary,  it  follows  that  if  a  player  has  more  than 
one  optimal  strategy,  he  has  an  infinite  number  of  optimal  strategies.  This 
infinite  set  of  optimal  strategies  satisfying  (3.3)  is  called  a  convex  set.  A 
similar  argument  may  be  applied  to  Red’s  strategies. 

Let  Ti  be  the  set  of  optimal  strategies  for  Blue  and  Tz  the  set  of  opti¬ 
mal  strategies  for  Red.  Then  T\  and  Tt  are  convex  sets.  It  follows  that 
every  member  of  T%  is  a  convex  linear  combination  of  certain  points  K{Ti) 
of  the  set  7\.  Thus  we  may  describe  T\  by  describing  all  members  of  K(Tf). 
It  will  turn  out  that  K(Tf)  is  a  finite  set. 

4.  OPERATIONS  ON  GAMES 

There  are  several  operations  one  may  perform  on  the  payoff  matrix 
without  altering  the  set  of  optimal  strategies  of  a  game.  These  operations 
are  sometimes  useful  in  solving  games  with  a  large  number  of  strategies. 

Permuting.  A  permutation  of  the  rows  or  columns  of  a  payoff  matrix 
permutes  the  components  of  the  solution  and  does  not  alter  the  value  of 
the  game.  We  can  also  permute  the  players  by  solving  the  game  whose 
payoff  matrix  is  the  negative  transpose,  —A',  of  the  payoff  matrix  A. 

Addition  of  and  multiplication  by  constant.  If  a  constant,  c,  is  added  to 
each  element  of  the  payoff  matrix  which  has  a  game  value,  v,  then  the 
value  of  the  new  game  is  v  +  c.  It  is  clear  that  the  sets  of  solutions  are 
the  same  for  both  games. 

If  every  element  of  A  is  multiplied  by  a  positive  number  c,  then  the 
new  game  has  a  value  cv.  Again,  the  set  of  solutions  is  unchanged. 
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5.  DOMINATED  STRATEGIES 

Let  us  define  a  poor  strategy  of  a  player  as  some  pure  strategy  which 
appears  with  zero  probability  in  every  optimal  mixed  strategy  of  that 
player.  That  is,  we  shall  say  Blue’s  fcth  pure  strategy  is  poor  if  we  have 
xt  —  0  in  every  Blue  optimal  mixed  strategy  X*  —  (x<). 

If  T\  is  Blue’s  set  of  optimal  mixed  strategies,  then  k  is  a  poor  strategy 
if  for  every  member  of  T\  we  have  xt  =  0.  Since  a  poor  strategy  never  ap¬ 
pears  with  positive  probability  in  any  optimal  mixed  strategy,  it  will  never 
be  played.  Knowing  the  poor  strategies,  if  any,  we  can  reduce  the  size  of 
the  game  by  the  number  of  poor  strategies. 

One  method  of  examining  a  game  for  poor  strategies  is  to  examine  the 
payoff  matrix  for  dominances.  Suppose  that  the  payoff  matrix  A  —  (a<,) 
is  such  that 

(3.4)  aij  >  akj  forj  =  1, 2, .  .  . ,  n, 

or  that  row  i  dominates  strictly  row  k.  Then  we  say  that  strategy  k  of 
Blue  is  dominated  by  strategy  i,  or  k  is  a  dominated  strategy.  However, 
for  Red,  if  column  r  dominates  strictly  column  s,  i.e.,  if 

(3.5)  air  >  ait  for  i  =  1,  2, .  .  . ,  m, 

then  strategy  r  is  dominated  by  strategy  s. 

Let  us  assume  that  for  Blue  strategy  k  is  dominated  by  strategy  i,  or 

akj  <  an  forj  =  1,2,  ,  n. 

Then  if  Y*  =  (y*)  is  an  optimal  strategy  for  Red,  we  have 

n  n 

ut  =  S  akjy*  <  2  a^y*  =  u*t  <  v, 
i- 1  i  =  l 

or  ut  <  v. 

It  follows  that  for  any  X*  in  T\, 

xt  =  0. 

Hence  strategy  k  is  a  poor  strategy.  Similarly,  we  can  show  that  if  (3.5) 
is  satisfied,  then  strategy  r  is  a  poor  strategy  for  Red.  We  have  thus 
shown  that 

(i)  If  a  strategy  is  dominated  by  another  strategy,  the  dominated 
strategy  is  a  poor  strategy. 

Suppose  that  there  exist  pure  strategies  k,  l,  and  r  and  a  number 
X(0  <  X  <  1)  such  that 

(3.6)  ak,  <  \aij  -(-  (1  —  X)arj  for  j  =  \,  2, ...  ,n. 


PROPERTIES  OF  OPTIMAL  STRATEGIES  41 

That  is,  suppose  strategy  h  is  dominated  by  a  convex  linear  combination 
of  strategies  l  and  r.  Then  for  any  Y*  we  get 

ut  =  2 akjy*  <  \2ai,y*  +  (1  —  X)  2 ar/y*  =  Xu*  +  (1  —  X)u*. 

But  u*  <  v  and  u*  <  v. 

Hence  ut  <  Xv  +  (1  —  X)v  =  v. 

Again,  this  implies  that  for  any  X*  in  Ti, 

xt  =  o, 

or  the  kth  strategy  is  a  poor  strategy.  We  have  thus  shown  that 

(ii)  If  a  strategy  is  dominated  by  a  convex  linear  combination  of 
other  strategies,  the  dominated  strategy  is  a  poor  strategy. 

Now  suppose  a  convex  linear  combination  of  strategies  fc  and  l  is  dom¬ 
inated  by  a  convex  linear  combination  of  strategies  r  and  s — i.e.,  suppose 
there  exist  X  and  m  where  0  <  X  <  1  and  0  <  m  <  1,  such  that 

(3.7)  Xaicj  +  (1  —  X)dij  <  yaTj  +  (1  —  u) a*,-  for  j  =  1,  2, ...  ,  n. 
Multiplying  (3.7)  by  y*  and  summing  over  all  j,  we  get 

X  S  akjy*  +  (1  -  X)  S  ahy*j  <ft2  arjy*  +  (1  -  X)  2  a./y). 

Hence 

(3.8)  Xut  +  (1  -  X)ut  <  M«*  +  (1  -  mK  <  v. 

Now  ut  <  v  and  u*  <  v.  If  ut  —  v  and  u *  =  v,  then  from  (3.8)  we  get  the 
contradiction  v  <  v.  Hence  either  ut  <  v  or  u*  <  v.  Therefore,  either 
xt  =  0  or  xf  =  0.  We  have  shown  that 

(iii)  If  a  convex  linear  combination  of  strategies  is  dominated  by  a 
convex  linear  combination  of  other  strategies,  then  there  exists 
at  least  one  poor  strategy  among  the  dominated  convex  combina¬ 
tion  of  strategies. 

Finally,  suppose  that  the  payoff  matrix  A  can  be  decomposed  into  four 
submatrices  A i,  A2,  As,  A4  as  follows: 


where  the  sub  matrices  A2  and  A3  have  the  following  properties: 


(3.9) 


(a)  Each  column  of  A2  dominates  strictly  some  column  of  A^ ' 

(b)  Each  row  of  A 3  is  dominated  strictly  by  some  row  of  Ai. 


Let  X*,  FI  be  a  solution  of  the  game  having  Ai  as  the  payoff  matrix.  Then 
it  is  readily  verified  that  the  pair  of  full  vectors  X*  —  (X*,  0, .  .  . ,  0), 
Y*  —  (Fj,  0,  .  .  .  ,  0)  is  a  solution  of  the  original  game  A.  Thus  Blue  has 


42  PROPERTIES  OF  OPTIMAL  STRATEGIES 

optimal  mixed  strategies  which  mix  only  those  strategies  associated  with 
A\,  and  Red  has  similar  optimal  strategies.  We  shall  now  show  that  these 
are  the  only  optimal  strategies.  If  Blue  has  an  optimal  mixed  strategy 
which  contains  a  pure  strategy  k  determining  A3,  then  xt  >  0.  Therefore 
for  any  optimal  Red  strategy  Y*, 

ut  =  v. 

But  if  Y*  =  (F*,  0, .  .  . ,  0),  it  follows  from  the  dominance  assumption 
that 

ut  <  v, 

a  contradiction  to  the  optimality  assumption.  Therefore,  in  every  optimal 
strategy,  xt  =  0  for  all  k  determining  A3.  Thus  all  Blue’s  strategies  which 
determine  A3  are  poor  strategies.  Similarly  the  Red  strategies  which  deter¬ 
mine  A  2  are  poor  strategies.  It  is  interesting  to  note  that  these  conclusions 
are  independent  of  A4.  We  have  shown  that 

(iv)  If  the  payoff  matrix  A  can  be  decomposed  in  such  a  way  that 
(3.9)  is  satisfied,  then  all  the  solutions  of  A  are  obtained  by  solving 
A  i.  The  remaining  strategies  are  poor  strategies. 

6.  ALL  STRATEGIES  ACTIVE 

We  have  seen  that  for  any  game  the  expectation  of  Blue  is  at  most  v,  the 
value  of  the  game,  if  he  uses  a  pure  strategy  and  Red  uses  an  optimal  mixed 
strategy.  Similarly,  the  least  that  Red  can  expect  to  pay  Blue  is  v  if  Red 
uses  a  pure  strategy  and  Blue  uses  an  optimal  mixed  strategy.  Further, 
each  player  has  at  least  one  pure  strategy,  which,  if  used  against  an  oppo¬ 
nent’s  optimal  mixed  strategy,  yields  exactly  v  to  Blue.  Now  it  may  hap¬ 
pen  that  the  game  matrix  is  such  that  every  pure  strategy,  if  used  against 
an  opponent’s  optimal  mixed  strategy,  yields  exactly  v  to  Blue.  In  such  a 
case  we  say  all  strategies  are  active  in  this  game.  This  notion  will  be  useful 
for  describing  the  extreme  points  of  the  convex  set  of  optimal  strategies. 

Define  Jn  to  be  a  row  vector  which  has  n  elements,  each  of  which  is  1 ; 
thus 

J<  =  (1,  1,  1,  1). 

Now,  if  X*,  Y*  is  a  solution  of  the  game  having  an«X«  payoff  matrix 
A,  then  all  strategies  are  active  if 

(3.10)  U*  —  AY*  =  vJ'n  and  L*  =  A'X*  =  vJ'n. 

Suppose  the  payoff  matrix  is?iX»  and  nonsingular.  If  all  strategies 
are  active  in  this  game,  then  the  solution  is  readily  obtained.  Using  (3.10) 
and  solving  for  X*  and  Y*  we  have 

(3.11)  X*  =  v{A')~lJn, 


Y*  =  vA-'Jl. 
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Since  X*  and  Y*  are  probability  distributions,  we  have 

JnX*  =  JnY*  =  1. 

Substituting  in  (3.11)  we  get 

1  1 


v  = 


(3.12) 


X*  = 


{A')-xJ'n 


JnA~'J'n’ 
A~lJ'n 


Y*  = 


Jn{A')-lJn  ’  ‘  JnA~lJn 

Thus,  if  all  strategies  are  active  in  a  square  nonsingular  payoff  matrix, 
the  solution  of  the  game  can  be  obtained  from  (3.12).  Conversely,  if  the 
components  of  X*  and  F*  in  (3.12)  are  non-negative,  then  X*,  Y*  is  a 
solution  of  the  game. 

Now  if  A  is  an  n  X  n  nonsingular  matrix,  then 


DA-1  = 


An 

An 


-A  21 

An 


An  i 
An  2 


=  H , 


where  D  is  the  determinant  of  A  and  A  a  is  the  cofactor  of  the  element  aiS 
in  A.  Recall  that  II  =  DA~l  is  the  adjoint  of  A. 

We  can  express  (3.12)  in  terms  of  the  adjoint  matrix  as  follows: 


v  - 


determinant  of  A 


D  =  _ 

S  An  sum  of  all  elements  of  adjoint  matrix’ 
i.i 


_  izl 
x(  ~ 


2  A,- 


2  Aj 
i,3 


row-sum  of  elements  of  adjoint  matrix 
sum  of  all  elements  of  adjoint  matrix 


y)  = 


2  Aji 

1  =1 

2  A^ 

IJ 


column-sum  of  elements  of  adjoint  matrix 
sum  of  all  elements  of  adjoint  matrix 


7.  OPTIMAL  STRATEGIES  AS  EXTREME  POINTS 

We  have  shown  that  if  the  payoff  matrix  is  nonsingular  and  all  strat¬ 
egies  are  active,  then  the  optimal  strategies  X*,  Y*  and  value  of  game  are 
given  by  (3.12).  Hence  X*  is  an  element  of  T\  and  Y*  is  an  element  of  T-i, 
where  Ti  and  T-2  are  the  convex  sets  of  optimal  strategies  of  Blue  and  Red, 
respectively.  However,  the  X*  and  F*  given  by  (3.12)  are  special  points 
of  Ti  and  T2.  We  shall  show  that  they  are  extreme  points  of  the  convex 
sets  T i  and  T2,  respectively. 
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Suppose  that  X*,  which  is  given  by  (3.12),  is  not  an  extreme  point  of 
T\.  Then  we  can  write 

X*  =  XXi  +  (1  -  X)X2, 

where  Xi  and  X2  are  extreme  elements  of  T,.  Since  Xi  and  X2  are  optimal, 
it  follows  that 

(3.13)  A’X j  >  vJ'n, 

(3.14)  A'X,  >  vJ'n. 

It  follows  that 

\A’X,  +  (1  -  X),4'X2  =  ^'[XXx  +  (1  -  X)X2]  =  A'X*  =  vJL 

From  (3.13)  we  have  that  each  component  of  the  vector  A'X i  is  larger 
than  or  equal  to  v.  Suppose  that  some  component  of  Xi  were  actually 
larger  than  v ;  then  correspondingly  A'X *  >  v  for  the  same  component. 
This  is  impossible  since  we  have  assumed  A'X*  =  vJ'n.  Therefore 
A'X,  =  vJl  Similarly,  A'X,  =  vJ'n. 

We  have  thus  shown  that 

A'{X,  -  X2)  =  0,7'. 

Since  A'  is  nonsingular,  this  implies  X,  =  X2.  Hence 
x*  =  XXi  +  (1  -  X)X2  =  Xi, 
or  X*  is  an  extreme  point  of  T,. 


8.  EXTREME  POINT  WHICH  YIELDS  SUBMATRIX 


Suppose  the  payoff  matrix  A  is  such  that  there  exists  a  nonsingular 
r  X  r  submatrix  B  for  which  all  strategies  are  active.  Then  for  the  sub¬ 
game  B  the  game  value,  vT,  and  optimal  strategies,  X*,  Y‘,  are  given  by 


Vr 


_J _ 

JrB-V'r  ’ 


(3.15) 


X r  = 


JrB-1 

JrB-V'r 


_  JAB-1)' 
JrB-lJ'T  ' 


Since  the  problem  of  solving  a  game  is  not  affected  by  interchanging  rows 
or  columns,  we  can  suppose  without  loss  of  generality  that  B  is  situated  in 
the  upper  left-hand  corner  of  A,  i.e., 


B  = 


o-n  On 


Olr 


Larl  Or, 
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Let  X*'  —  (x*,  x*,  ,  x*)  be  an  optimal  strategy  of  B  which  is  given 

by  (3.15).  Now  suppose  that 

X*'  =  (a$,  *5, ....  0, 0, ,  0) 

is  an  optimal  strategy  in  the  game  with  payoff  matrix  A.  We  shall  show 
that  X*  is  an  extreme  point  of  Ti.  For  if  we  suppose  the  contrary,  then 
there  are  distinct  strategies  U  =  (u1;  .  .  . ,  um)  and  W  —  (wh  .  .  .  ,  wm)  of 
T\  such  that 


x*  =  \u  +  (1  -  X)TF, 

0  <  X  <  1. 

In  terms  of  the  components  of  the  vectors,  we  have 

(3.16)  x*  =  \Ui  +  (1  —  \)wi, 

But 

*  =  1,  2, .  . 

. ,  Wl. 

1! 

© 

«s>. 

II 

+  1,  r  +  2,  .  . 

. ,  m. 

It  follows  that 

Ui  =  Wi  —  0,  i  —  r 

+  1,  r  +  2,  .  . 

. ,  m. 

Therefore 

r  m 

t=l  t-1 

3  =  1,  2,  • 

.  .  ,n. 

Since  U  is  optimal,  we  have 

m 

2  dijUi  >  V, 
i  =  l 

i  =  1,  2,  •  . 

■  ■  ,n. 

And  hence 

(3.17)  2  ciijUi  >  v , 

i  =  i.  2,  . . 

,  .  ,n. 

Similarly,  we  conclude  that 

CO 

I— ‘ 
00 

"a 

| 

IV 

7  =  1,2,.. 

From  (3.15)  we  have 

X’B  - 

r 

hence  2  a^x*  =  v, 

i  =  1 

3  =  1,  2, . 

.  .  ,r. 

Making  use  of  (3.16),  we  have 

r  r 

(3.19)  X  2  aijUi  +  (1  —  X)  2  =  v, 

i  — 1  i  —  1 

3  =  1,  2,  . 

.  .,r. 

From  (3.17)  and  (3.18)  and  using  (3.19),  it  follows  that 

r  r 

2  aijUi  =  2  =  v, 

1  i  =  l 

j  =1,2,. 

.  ,,r. 

Hence 


U'rB  =  W'B, 
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which  implies 

(U'r  -  W'r)B  =  0. 

Since  we  have  supposed  that  U  and  W  are  distinct  vectors,  it  follows  that 
B  is  a  singular  matrix,  contrary  to  hypothesis. 

We  have  therefore  shown  that  an  optimal  strategy  associated  with  a 
nonsingular  submatrix  is  an  extreme  point  of  T\. 


9.  SUBMATRIX  WHICH  YIELDS  EXTREME  POINTS 


We  shall  now  show  that  if  X*  and  Y*  are  extreme  points  of  Ti  and  T2, 
respectively,  then  there  exists  a  nonsingular  submatrix  B  of  A  such  that 


v 


1 

-  .JrB-'J'r’ 


(3.20)  xr  =  Jgg 

MB-')' 

JrB-Kl’r 

where  r  is  the  order  of  B,  X*  is  the  vector  obtained  from  X*  by  deleting 
elements  corresponding  to  the  rows  deleted  to  obtain  B  from  A,  and 
similarly  for  Y*. 

To  prove  this  we  shall  take  a  pair  of  extreme  points,  X*,  Y*,  and  con¬ 
struct  a  nonsingular  submatrix  B  satisfying  (3.20).  For  convenience,  we 
shall  assume  that  v  0. 

First,  relabel  the  rows  and  columns  of  A,  by  using  the  components  of 
X*  and  Y*,  as  follows: 

Xf  >  0,  1  <  i  <  m'; 

x*i  =  0,  ml  +  1  <  i  <  m; 

Vi  >0,  1  <  3  <  n'; 

Y)  =  0,  n'  +  1  <  j  <  n. 

Now  arrange  the  rows  and  columns  corresponding  to  x*t  =  0  and  y)  =  0 
as  follows: 

n  ■ 

S  cn3y*j  =  v,  m!  +  1  <  i  <  m"; 

j  =  l 
n 

S  aids  <v>  m"  +  1  <  i  <  m; 

j=l 


m 


2 

i  =  1 


atp:\  —  v, 


n'+l<j<  n"; 


m 

S  ad  >  v,  n"  +  1  <  j  <  n. 

i  =  1 

Define  the  vectors  Dj  and  C,  as  follows: 

Dj  —  (ctiy,  a2jj  .  .  . ,  arn’j) ,  1  3  —  /a  ? 

Ci  =  (aa,  ai2,  .  .  . ,  Oi„'),  1  <  i  <  m". 
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Now  arrange  the  columns  of  the  matrix  for  which  n'  +  1  <  j  <  n"  in 
such  a  way  that  D„>+ 1,  Dn’+ 2,  .  .  . ,  Dt  are  linearly  independent  of 
Di,  Da,  .  .  .  ,Dt  and  Dt+h  .  .  .  ,  Dn"  are  linearly  dependent  on  Dh 
Da,  ... ,  Dt.  That  is,  we  find  some  number  t  such  that  n'  +  1  <  t  <  n" 
and  such  that  there  do  not  exist  numbers  Xi,  X2, .  .  . ,  Xt  satisfying 

Dj  =  X1D1  +  X2D2  +  .  .  .  +  Xj_iZ)y_i  +  Xj+xDy+i  +  .  .  .  +  XtD(, 

n'  +  1  <  j  <  t, 

and  there  do  exist  numbers  Xi,  X2,  .  .  . ,  X(  satisfying 

Dj  —  X1D1  +  X2D2  +  .  .  •  +  \J),,  t  +  1  <  j  <  n”. 

Similarly,  arrange  the  rows  of  the  matrix  for  which  m'  +  1  <  i  <  m" 
in  such  a  way  that  CV+i,  CV+ 2, .  .  .  ,CS  are  linearly  independent  of 
Ci,  Ct,  ...  ,C,  and  C,+i,  C>+2,  .  .  . ,  Cm"  are  linearly  dependent  on 
Ci,  Ct,  ...  ,C,  where  m'  +  1  <  s  <  m".  That  is,  we  can  find  a  number  s 
such  that  if  m'  +  1  <  i  <  s,  there  do  not  exist  numbers  m,  ft  a, ns 
so  that 

(3.21)  Ci  =  ftiCi  +  H2C2  +  .  .  .  +  /ti_iC,_i  +  Mi+iCi+i  +  .  .  .  +  MsCs, 
and  if  s  +  1  <  i  <  m",  there  do  exist  m,  j u2,  ...»  Ms  such  that 

(3.22)  Ci  =  M1C1  +  ft2C2  +  .  .  .  +  m«C8. 

We  may  represent  the  preceding  decomposition  of  the  m  X  n  matrix  as 
shown  in  Fig.  5. 


Figure  5 
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We  shall  now  show  that  the  sX(  matrix 


is  nonsingular.  Define  the  vectors 

.Z?i  (a*i,  .  .  .  ,  (lit))  1  ^  i  ^  s. 

Then,  if  B  is  singular,  we  must  have  dependence  among  rows  or  columns. 
Let  us  assume  the  dependence  among  rows.  Then  there  exist  constants  c<, 
not  all  zero,  such  that 

(3.23)  ciBi  +  C2B2  +  .  .  .  +  caBa  =  0. 

Suppose  c,  5^  0  for  some  i  where  m!  +  1  <  i  <  s.  Then  we  may  divide 
(3.23)  by  Ci  and  write 

Bi  =  ct\B\  +  a2B2  +  .  .  .  +  +  ai+iBi+i  +  .  .  .  +  ot„Ba. 

In  particular,  we  have 

Ci  —  aiCi  +  0C2C2  +  .  .  .  +  asCs, 

which  contradicts  the  assumption  of  independence  of  C,  for  m!  +  1  <  i  <  s. 
Hence  c<  =  0  for  m'  +  1  <  i  <  $. 

Thus  (3.23)  becomes 

C\B\  +  C2B2  +  .  .  .  +  cm’Bm'  =  0, 

m! 

or  S  aan  =  0,  1  <  j  <  t. 

1  =  1 

Now,  since  n'  +  1  <  t  <  n"  we  have 

t 

S  any)  =  v  for  1  <  i  <  m'. 

j=i 

Multiplying  by  c„  we  get 

t 

Ci  S  di/y)  =  dv. 

Summing  over  i,  we  have 

t  m'  mf 

2  2  ctUijy)  =  v  S  Ci, 

j  =  U  =  l  i  =  l 

m ' 

or  0  =  v  S  Ci. 

>=1 

Since  v  0,  it  follows  that 

m' 

S  Ci  =  0. 

i  =  I 


(3.24) 
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Now  define  the  following  m  component  vectors, 

Xe  =  ( x\  +  «Ci,  x2  +  «c2,  .  .  .  ,  V  +  eCm',  0,  .  .  .  ,  0), 

X—  f  —  (Xi  fCi,  X2  €C2,  .  .  .  ,  Xm'  f&m' ,  11,  -  ■  -  ,  11)) 

where  e  is  arbitrary  and  c<  satisfies  (3.24).  Then  we  can  find  an  e  such  that 

xi  +  eCj  >  0,  1  <  i  <  j»'; 

xt  —  tCi  >0,  1  <  i  < 

Hence 

m  mm  m' 

2  (x*4  +  tCi)  =  2  x*i  +  e  S  Ci  =  1  +  €  S  ct-  =  1, 

i=l  *  i=l  1=1  i—l 

m  m' 

S  (x\  —  tCi)  =  1  —  e  S  c<  =  1. 

l—l  t=l 

Therefore  Xe  and  X_e  are  mixed  strategies  for  Blue. 

Now  for  all  j,  we  have 

m  m 

2  a,n(x\  ±  eCi)  =  2  a.jx)  >  p. 
i=i  «=i 

Therefore  Xe  and  X_e  are  optimal  strategies  for  Blue.  But 

X*  =  ux<  +  X_e), 

which  contradicts  the  assumption  that  X*  is  an  extreme  point  of  TV  Hence 
B  is  nonsingular.  This  implies  that  B  is  a  square  matrix,  or  s  =  t.  Further, 
it  is  clear  from  our  construction  of  s  and  t  that  for  the  B  matrix  all  strat¬ 
egies  are  active.  Therefore,  (3.20)  will  hold  for  the  B  matrix. 

10.  DETERMINING  THE  SETS  OF  OPTIMAL  STRATEGIES 

The  sets  of  optimal  strategies,  T\  for  Blue  and  T2  for  Red,  are  convex 
sets.  Hence  they  are  determined  by  the  extreme  points  of  the  sets.  Now 
for  each  pair  of  extreme  points,  one  from  T\  and  one  from  T2,  there  is  a 
nonsingular  submatrix  whose  solution  has  all  strategies  active.  Further, 
every  nonsingular  submatrix,  whose  solution  has  all  strategies  active  and 
which  is  also  a  solution  of  the  original  matrix,  determines  a  pair  of  extreme 
points  of  Ti  and  T2.  Since  there  are  only  a  finite  number  of  submatrices 
in  each  matrix,  it  follows  that  there  are  only  a  finite  number  of  extreme 
points  of  Ti  and  T2.  Therefore  the  sets  Ti  and  T2  are  polyhedral  sets 
spanned  by  a  finite  number  of  vertices. 

We  can  find  all  the  extreme  points  of  T\  and  T2  by  examining  all  the 
submatrices  of  the  payoff  matrix. 
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Example.  Let  us  determine  the  sets  of  optimal  strategies  T\  and  T2  for 
the  following  game : 


3" 

2 

3 


First,  we  test  for  strict  dominance  to  see  if  there  are  any  poor  strategies 
in  this  game.  We  find  no  strict  dominance,  although  there  are  several 
instances  of  nonstrict  dominance. 

To  obtain  all  the  extreme  points  of  Ti  and  T2  we  examine  all  square 
submatrices  that  can  be  formed.  There  are  nine  1X1  submatrices,  nine 
2X2  submatrices,  and  one  3X3  matrix.  Examining  1  X  1  submatrices  is 
equivalent  to  testing  A  for  saddle-points.  We  find  a  saddle-point  at  i  —  1, 
j  —  3.  Therefore  (1,  0,  0)  is  an  extreme  point  of  T\  and  (0,  0,  1)  is  an 
extreme  point  of  TV  The  corresponding  nonsingular  submatrix  is  (3). 

Of  the  nine  2X2  submatrices,  the  following  three, 


exhibit  strict  dominance  and  therefore  cannot  have  all  strategies  active. 
The  matrix 

[l 

is  singular  and  hence  need  not  be  considered.  For  each  of  the  remaining 
five  matrices, 


'5  31  [4  21  [3  31  T3  51  ' 

_2  3J  L.3  3J  |_4  2J  [_3  2  J’ 


all  strategies  are  active.  Their  solutions  can  be  computed  by  inverting  the 
corresponding  matrices.  For  example,  to  solve  the  second  matrix  of  the 
five,  we  have  the  following  computations: 


If  we  test  (0,  0,  1),  (§,  0,  §)  for  a  solution  of  the  original  game  A,  we  find 

3  _  3 

max  2  ai/y*}  =  3,  min  2  al3x*i  =  2. 

i  j = 1  j  i— 1 


Therefore  (0,  0,  1),  (§,  0,  |)  is  not  a  solution  of  A.  Of  the  five  2X2 
matrices  we  find  that  the  two  matrices. 
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Gi]  ['  2) 

yield  solutions  which  are  also  solutions  of  the  game.  They  are,  respectively, 

(i,  0,  |),  (0,0,1); 

(1,0,0),  (i,0,i). 

Finally,  the  3X3  full  matrix  must  be  tested  for  a  solution.  We  have: 


=  1  -18 
JzA-'Js  -6 

-i- 

=  Y*,  v{A')~1J,3  =  0  =  X*. 

% 

_3_ 

Thus  the  3X3  matrix  yields  an  extreme  point.  But  it  is  identical  to  one 
of  the  extreme  points  previously  determined  in  a  2  X  2  submatrix.  There¬ 
fore  Ti  is  the  line  joining  (1,  0,  0)  and  (|,  0,  f).  T2  is  the  line  joining 
(0,  0,  1)  and  (|,  0,  |).  Every  optimal  strategy  of  Blue  can  be  expressed  by 

a(l,0,0)  +  (l  -«)(*,  0,§),  0<«<1. 

Every  optimal  strategy  of  Red  can  be  expressed  by 

1»(0,  0,1)  +  (1  -£)(§,  0,£),  0  </?<!. 


11.  GEOMETRY  OF  SOLUTIONS 

We  have  described  the  properties  of  each  optimal  strategy.  We  also 
have  described  how  to  construct  the  set  of  optimal  strategies,  which  is  a 
convex  set.  In  this  section  we  shall  describe  additional  geometric  relation¬ 
ships  that  exist  among  the  sets  of  optimal  strategies.  These  relationships 
can  be  used  to  check  the  completeness  of  the  sets  of  solutions. 

We  shall  need  to  introduce  some  additional  terms  and  symbols  as¬ 
sociated  with  the  set  of  solutions.  To  clarify  these  terms  and  symbols  we 
shall  apply  them  to  the  particular  game  whose  payoff  matrix  is  defined  by 

Ri  R%  R% 

£i[“-l  3  -3" 

A  =  £2  2  0  3  . 

B\_  2  1  0 
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Let  the  set  of  pure  strategies  for  Blue  be  denoted  by  I  =  {1,2,  ...  ,m} 
and  for  Red  by  J  =  {1,2, ...  ,n) .  In  our  example  I  =  {Bi,  B2,  B3}, 
J  =  {Ri,  R-i,  R-i} . 

The  set  of  mixed  strategies  for  Blue  can  be  represented  by  points 
X  =  (xi,  Xi,  ...  ,  xra)  in  m-dimensional  space  whose  coordinates  are  non- 
negative  and  whose  sum  is  1.  The  set  of  such  points  defines  a  simplex  S(I). 
Red’s  set  of  mixed  strategies  is  a  simplex  S(J).  For  the  3X3  matrix  of 
our  example,  S(I )  is  an  equilateral  triangle  with  sides  of  length  V2  (Fig.  6). 


Z?2  /?2 


The  vertices  of  the  triangle  represent  the  pure  strategies.  Similarly,  S(J) 
is  an  equilateral  triangle.  Every  point  in  the  triangle  represents  a  mixed 
strategy. 

We  let  X(A)  be  the  set  of  optimal  mixed  strategies  of  Blue— i.e.,  for 
each  X  s  X(A)  we  have 

min  X'A  Y  —  v. 

Y 

Let  Y(A)  be  the  set  of  optimal  mixed  strategies  of  Red — i.e.,  for  YzY(A) 
we  have 

max  X'A  Y  =  v. 
x 

In  our  example,  it  can  be  verified  that  Blue  has  a  unique  optimal  strategy — 
li  0).  However,  Red’s  set  of  optimal  strategies  has  two  extreme 
points — (0,  f,  |)  and  (£,  f,  }).  Thus  X(A)  is  the  point  whose  coordinates 
are  (§,  f ,  0)  and  Y (A)  is  the  line  segment,  whose  end  points  are  (0,  f ,  |) 
and  (|,|,1). 

With  each  mixed  strategy  X  of  Blue  we  can  associate  a  set  h(X)  which 
is  the  set  of  pure  strategies  that  have  positive  weights  in  X.  Thus 

h(0,  1,  0)  =  {B2}  and  h( i  f,  0)  =  {Bh  B,}. 

Let  h(Y)  be  the  set  of  pure  strategies  of  Blue  which  yield  v  when  played 
against  Y.  In  our  example,  it  can  be  verified  that  I2( 0,  f ,  |)  =  {Bh  B2} . 
In  a  similar  manner  we  may  define  for  Red  the  sets  J\(Y)  and  J2(X). 
Thus 


and 


J i(i>  i>  i)  —  Rs} 

J *d,  f ,  0)  =  {Ri,  Ri,  R3}  • 
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Effective  strategies.  It  is  evident  that  for  all  X  interior  to  X(A )  the  set 
Ii(X)  is  the  same,  so  let  us  denote  this  set  by  I\.  Also,  for  all  Y  interior  to 
1  (A)  the  set  h(Y)  is  the  same;  we  shall  denote  it  by  /2.  Further,  it  can 
be  shown  that  the  two  sets  are  identical,  or 

h  —  i  i- 

The  identical  sets  I\  and  /2  are  called  the  effective  strategies  for  Blue.  They 
have  two  important  properties :  First,  they  are  the  strategies  which  appear 
with  positive  weights  in  every  interior  solution;  secondly,  they  yield  the 
value  of  the  game  against  every  interior  optimal  mixed  strategy  of  Red. 
With  a  similar  definition  for  Red’s  strategies,  it  follows  that 

J 1  —  J  2, 

and  they  are  the  effective  strategies  for  Red. 

In  our  example,  Blue  has  a  unique  optimal  strategy  X  =  (§,  f,  0). 
Thus  Ii(X)  =  {Bi,  Zb}.  Now  the  set  Y (A)  consists  of  strategies  of  the  form 

Y  _  fl  a  2  |  3  ,,  .  oc  .1  .  .”1 

F~L  5  ’3«  +  5(1  -  “)>3  +  5(1  -  «)J> 

where  0  <  a  <  I.  Now  for  each  Y  interior  to  F(A)  we  obtain  the  value 
of  the  game,  1,  for  strategies  B1  and  «2.  Thus  h{Y)  =  {Bh  B2}  =  h  =  h. 
Similarly  J1(Y)  =  {Rh  R2,  R3}  =  J \  and  J2{X)  =  {Rh  R2,  R3}  ==  J2  =  J,. 
Two  strategies,  Bi  and  _S2,  are  effective  for  Blue  and  all  three  strategies  are 
effective  for  Red. 


Essential  subgame.  If  we  construct  a  payoff  matrix  A\  having  the  set 
1 1  for  its  rows  and  J i  for  its  columns,  we  obtain  the  essential  submatrix  of  A. 
I  he  value  of  the  essential  subgame  is  the  same  as  the  value  of  the  game  A . 
Further,  X(A)  will  be  a  subset  of  X(Ai)  and  Y (A)  will  be  a  subset  of  Y (Ai). 

The  essential  submatrix  in  our  example  is  determined  by  I\  =  {B1}  B2} 
and  J i  =  (R1;  Rh  R3},  or 


A\ 


Ri  Ri  R ;? 

Rif-1  3  -31 
B2  L  2  0  3  J 


Solving  Ai  we  get  X(Ai)  =  (f,  f)  and  F(Ai)  is  the  line  connecting  (0,  f,  |) 
and  (J,  },  0).  The  value  of  the  game  A\  is  1.  Now  F (A),  which  is  the  line 
with  end  points  (0,  f,  1)  and  Q,  f,  |),  is  clearly  a  subset  of  F(Ai). 

Dimension  of  solutions.  The  sets  X{A)  and  F(A)  are  convex  polyhedra. 
Let  S(If)  be  the  simplex  of  the  effective  strategies  of  Blue.  Then  S(Ji)  is 
the  smallest  closed  simplicial  face  of  S(I )  that  contains  X(A).  Similarly, 
S(Ji)  is  the  smallest  closed  simplicial  face  of  S (./)  that  contains  F(A). 

Since  7i  =  {Bi,  B2}  in  our  example,  S(I i)  is  the  line  joining  Bi  and  B2 
of  S(I).  Since  Ji  =  {R\,  R2.  R?) ,  then  S(J j)  =  S (J) .  The  simplex  S(If) 
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has  dimension  1  and  S(Ji)  has  dimension  2.  Since  S(A)  consists  of  one 
point,  it  is  therefore  of  dimension  0.  The  dimension  of  Y (A)  is  1.  In  general, 
the  dimensions  of  these  various  sets  are  related  as  follows: 

dim  S(h)  —  dim  X(A)  =  dim  S(Ji )  —  dim  Y(A ) 

_  f  rank  Ai  if  v  =  0 
—  \  rank  Ai  —  1  if  v  t6  0. 

In  our  example,  we  have  rank  Ai  =  2,  v  —  1,  and  dim  T(A)  =  1. 
Substituting  in  the  above  we  have 

1— 0  =  2—  1  =  2—  1. 

12.  TARGET  SELECTION— FOR  ATTACK  AND  DEFENSE 

The  target  selection  problem  frequently  appears  in  military  situations. 
In  its  most  general  form  the  problem  may  be  described  as  follows:  Sup¬ 
pose  the  Attack  and  Defense  each  have  a  fixed  quantity  of  resources  to 
allocate  among  a  series  of  targets  of  different  values.  How  should  the 
Attack  and  Defense  select  the  targets  for  the  allocations? 

Let  us  examine  the  simplest  of  the  target  selection  problems.  The  At¬ 
tack  has  one  unit  and  the  Defense  has  one  unit;  each  unit  is  to  be  allocated 
to  some  target.  Which  target  should  receive  the  allocation.'’ 

The  game  may  be  described  as  follows: 

Targets.  There  are  n  targets  which  we  label  Ti,  Tt, .  .  . ,  Tn.  We  assume 
that  these  targets  have  values  Oi,  a^, ... ,  a„,  respectively,  which  are 
ordered  as  follows: 

^  c&2  ^  ^  an  ^  0. 

Attack.  The  Attack,  Blue,  has  one  attacking  unit  to  allocate  to  some 
one  of  the  n  targets. 

Defense.  The  Defense,  Red,  has  one  unit  of  defense  to  allocate  to  some 
one  of  his  targets.  It  is  assumed  that  the  unit  of  defense  has  a  defense 
potential  p — i.e.,  if  an  attack  is  made  on  a  defended  target,  then  with 
probability  p,  the  Attack  fails  to  destroy  the  target.  Hence  1  -  p  is  the 
probability  that  Blue  is  successful  in  destroying  this  attacked  target. 

Strategy.  A  strategy  for  Blue  is  a  choice  of  a  target  for  attack.  A  strat¬ 
egy  for  Red  is  a  choice  of  a  target  for  defense.  Hence  each  player  has  n 
strategies. 

Payoff.  We  shall  assume  that  if  an  attack  is  made  on  an  undefended 
target,  Tk,  then  the  payoff  to  Blue  is  the  value,  a*,,  of  that  target.  How¬ 
ever,  if  an  attack  is  made  on  a  defended  target,  Ti,  then  the  payoff  is 
(1  —  p)a,,  the  expected  damage  to  the  target.  Therefore,  the  payoff  matrix 
is  the  following: 
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Target  Selection  Payoff 
Target  defended 


Tx 

t2 

Tn 

Ti 

"(1 

~  P)ai 

Oi 

dl 

(1  -  p)a2 

...  CL% 

Tn 

&n 

On 

.  .  .  (1  —  p)an_ 

T arget 
attached 


Because  of  the  special  form  of  the  payoff  matrix,  we  shall  solve  this 
game  by  guessing  a  solution  and  then  verifying  it.  However,  to  make  a 
good  guess  it  is  necessary  to  make  some  preliminary  investigations. 

It  is  reasonable  to  expect  the  solution  to  have  the  property  that  each 
player  randomizes  over  the  same  high-valued  targets.  That  is,  let  us 
assume  that  X *,  F*  is  of  the  following  form : 

X*  =  (xh  x2,  .  .  . ,  xt,  0,  0,  0), 

=  (V h  2/2,  ,  yt,  0,  0,  0), 

where 

Xi  >0,  i  <  t; 

Xi  =  0,  i  >  t; 

Vi  >0,  j  <  t; 

Vi  =  0,  j  >  t; 

where  the  value  of  t  is  to  be  determined. 

Since  yi  >  0,  y2  >  0,  it  follows  that 

(1  p)a  1X1  +  a2Xi  +  a,tXt  =  v, 

o-iXi  +  (1  —  p)a2x 2  +  .  .  .  +  atxt  =  v. 

Subtracting  the  first  equation  from  the  second,  we  get 

paxxx  —  pa-iXi  =  0, 


or 


In  general,  we  have 
However, 


x2  = 


Xi 


i  =  1 


aixx 

Q>2 

CLlXi 

Cli 


for  i  <  t. 


t 

—  V 

aiXi 

t  =  l 

a. 

k  i 

2  ~ 
i= i  a% 

=  1. 


Let  us  define 
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We  get 


Therefore  X*  must  be  such  that 


for  i  <  t, 


=  0  for  i  >  t. 

To  obtain  the  form  of  Y*  we  note  that  Xi  >  0  and  £2  >  0.  Therefore 
(1  —  p)eui/i  +  aty2  +  ...  +  axyt  =  v, 
a2yi  +  (1  —  y)a2y2  +  -  ■  •  +  a2yt  =  v. 

From  this  pair  of  equations  and  since  Y*  is  a  mixed  strategy,  we  get 


-  [1  -  -  (l  -  pyi)\ 

V  L  a2  J 


Using  the  fact  that  X\  >  0  and  X;  >  0  for  i  <  t  we  obtain,  by  a  similar 
argument, 


Mi  =  - 1(1  -  Wi)]  for  j  <  t. 


s  ys  =  1. 

;  =  1 


This  yields  F*  of  the  form 


for  j  <  t, 


=  0  for  j  >  t. 

We  also  obtain  a  guess  of  the  value  of  the  game,  since  we  must  have 
v  =  (1  —  p)aiXi  +  02^2  +  .  .  .  +  dtXt 

t 

—  S  ciiXi  —  paiXi 

1  =  1 


Verification.  We  shall  now  prove,  by  verifying,  that 
(i)  The  optimal  strategy  for  Attack  is 


for  i  <  t, 


for  i  >  t. 
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(ii)  The  optimal  strategy  for  Defense  is 


Vi 


V  V  a,-  At  ) 


=  0 

(iii)  The  value  of  the  game  is 


for  j  <  t, 
for  j  >  t. 


t  —  p  k  —  v 

v  =  — —  =  max  — t-1-- 

Note  that  the  value  of  t  is  defined  by  the  location  of  the  maximum 
of  (k  -  p)/Ak. 

Let  us  verify  that  the  strategy  above  is  optimal  for  the  Attack.  Clearly 

n 

Xi  —  0  and  S  *<=1.  Hence  (i)  defines  mixed  strategy.  Computing 
expectations,  we  have 

n  n 

2  anXi  =  2  ciiXi  +  a,j(l  —  p)xj 

i=»l  i  =  l 


2  di  .  +  dj(  1 

diAt 

2  di~- 

diAt 

for  j  >  «, 

r  l=jr 

At 

for  j  <  t, 

-  At 

for  j  >  t. 

2  al7*i  > 

i=  1  Aj 

for  all  j. 

Therefore, 


Now  let  us  verify  that  (ii)  is  optimal  for  Defense.  Clearly  2  y,  =  1. 

y=i 

I  hat  yj  >  0  will  be  verified  later.  Now  compute  expectations 

n 

2  ciijyj  =  2  o-iyj  +  (1  —  p)diyi 

3  =  1  i  5*y  <n 

=  “i  2  Vi  +  (1  -  v)aiVi 

i  **3  <n 

=  di(l  -  py{) 

t  ~  V 
At 


if  i  <  t, 
if  i  >  t. 
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This  implies  that  if  Y*,  as  defined  above,  is  a  solution,  we  need 

di  <  for  *  >  L 

~  At 

In  particular, 

^  t  -  V 

a(+1  <  -jf’ 

or  p  <  t  —  at+iAt  =  t  +  1  —  a(+iA(+i. 

Also,  since  we  require  yj  >  0,  we  get  p  >  t  —  atAt. 

The  preceding  leads  us  to  study  the  function 

<j>k  —  k  dkAk- 

Let  us  compute 

4>k+i  —  =  k  1  —  cik+iAk+i  k  T-  a, kAk 

=  1  —  (iti-iAk+i  +  dkAk 

>  1  —  dk+i  (An  +  )  +  ak+iAk  =  0. 

\  a,k+ 1/ 

Thus  0  is  a  strictly  increasing  function  of  k.  Since  <f>\  =  0,  and  if  we  define 
0„+i  =  oo ,  then  it  follows  that  there  exists  exactly  one  positive  integer 
t  <  n  such  that 

<t>t  <P  <  4>t+ 1. 

For  this  integer  t,  we  have 

t  —  dtAt  <  p  <  t  +  1  —  ctf+iAi+i. 


This  gives  us  the  inequality 

.  t  ' 

dt+ 1  <  ^  S  at- 

This  inequality  combined  with 


yields 


ai  >  o2  >  .  .  .  >  dn 


for  i  <  t, 


>  df  for  i  >  t. 

At 

To  complete  the  verification  we  need  to  show  that  y,  >  0  for  j  <  t. 
This  is  exactly  the  statement 

<  a,  for  j  <  t 

At  ~ 

which  has  been  shown  above. 

Recall  that  the  value  of  t  was  defined  by  the  inequality 


4>t<P< 
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We  shall  now  show  that  t  is  also  given  by 


The  function 


max 

k<n 


k~p  =  *  -  v 

Ak  At 


m  = 


fc  —  v 

Ah 


has  a  maximum,  say  at  k  —  u.  Hence  we  have 

/(«  +  1)  <  /O), 

/(«  -  1)  <  /(«)• 

Evaluating  the  function  at  u,  u  —  1,  and  u  +  1,  these  inequalities  yield 


u  —  auAu  <  p  <  u  +  1  —  au+iAu+\. 

This  is  exactly  the  inequality  satisfied  by  the  integer  t.  Hence  u  —  t. 


13.  SOLUTION  OF  THE  GAME  “LE  HER” 

This  game  was  described  in  Example  3,  Chapter  1.  If  we  define  the 
payoff  to  be  1  if  the  dealer  wins,  and  0  if  the  receiver  wins,  then  we  can 
compute  the  payoff  matrix  associated  with  the  213  X  213  ways  of  playing 
“le  Her.”  For  example,  if  the  strategy  selected  by  the  receiver  is  to  change 
any  card  which  is  6  or  under  and  to  hold  any  card  which  is  7  or  over,  and 
if  the  strategy  selected  by  the  dealer  is  to  change  any  card  which  is  8  or 
under  and  to  hold  any  card  which  is  9  or  over,  then  the  payoff  corresponding 
to  this  way  of  playing  the  game  is  obtained  from  the  matrix 


Receiver 


C 

C 

C 

C 

C 

C 

S 

S 

S 

S 

S 

S 

s 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

c 

1 

"50 

50 

50 

50 

50 

50 

23 

19 

15 

11 

7 

3 

0" 

c 

2 

43 

50 

50 

50 

50 

50 

23 

19 

15 

11 

7 

3 

0 

c 

3 

39 

39 

50 

50 

50 

50 

23 

19 

15 

11 

7 

3 

0 

c 

4 

35 

35 

35 

50 

50 

50 

23 

19 

15 

11 

7 

3 

0 

c 

5 

31 

31 

31 

31 

50 

50 

23 

19 

15 

11 

7 

3 

0 

c 

6 

27 

27 

27 

27 

27 

50 

23 

19 

15 

11 

7 

3 

0 

Dealer  C 

7 

23 

23 

23 

23 

23 

23 

26 

19 

15 

11 

7 

3 

0 

C 

8 

19 

19 

19 

19 

19 

19 

26 

22 

15 

11 

7 

3 

0 

S 

9 

15 

15 

15 

15 

15 

15 

50 

50 

50 

0 

0 

0 

0 

s 

10 

11 

11 

11 

11 

11 

11 

50 

50 

50 

50 

0 

0 

0 

s 

11 

7 

7 

7 

7 

7 

7 

50 

50 

50 

50 

50 

0 

0 

s 

12 

3 

3 

3 

3 

3 

3 

50 

50 

50 

50 

50 

50 

0 

s 

13 

_50 

50 

50 

50 

50 

50 

50 

50 

50 

50 

50 

50 

50_ 

where  the  elements  in  the  matrix  represent  50  times  the  probability  of  the 
dealer’s  winning  for  each  of  the  possible  ways  that  the  cards  may  be  dealt. 
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Now  the  probability  that  the  particular  pairs  of  cards  will  be  dealt  is 
given  by 

-h  '  A  =  drr  f°r  nondiagonal  elements, 


■h  '  -ix  ~  rtp  f°r  diagonal  elements. 

Therefore  we  need  to  multiply  diagonal  elements  of  the  matrix  above  by 
3/ (663)  (50)  and  nondiagonal  elements  by  4/ (663)  (50)  and  add  the  169 
results,  which  gives  16,146/  (663)  (50),  the  payoff  for  this  pair  of  strategies. 

It  is  not  necessary  to  compute  the  payoff  for  the  entire  21S  X  213 
matrix,  for  it  is  apparent  that  any  strategy  in  which  exactly  r  changes  of 
cards  appear  is  dominated  by  a  strategy  in  which  the  first  r  cards  are 
changes  and  the  remaining  13  —  r  are  stays.  This  follows  from  a  dominance 
argument.  It  is  also  intuitively  evident,  because  if,  for  example,  it  is  ad¬ 
visable  to  stay  with  a  3  and  to  change  a  4,  it  is  certainly  more  advisable 
to  change  a  3.  This  reduces  the  matrix  to  14  X  14.  Finally,  the  14  X  14 
matrix  can  be  reduced  to  a  2  X  2  matrix,  again  by  a  dominance  argument. 
The  resulting  2X2  matrix  is  the  following: 


Dealer 


Hold  8 
and  over 

Change  8 
and  under 


Deceiver 


Hold  7 

Change  7 

and  over 

and  under 

16,182 

16,122 

33,150 

33,150 

16,146 

16,182 

33,150 

33,150 

Solving  the  2X2  game,  we  obtain  the  optimal  strategies 


for  the  Dealer: 


for  the  Receiver: 
and  the  value  of  the  game  is 


rs' 

8 

3 

8. 


for  the  Dealer:  0.487, 

for  the  Receiver:  0.513. 


14.  SOLUTION  OF  THE  GAME  OF  “MORRA” 

This  game  was  described  in  Example  2,  Chapter  1.  Since  the  game  is 
symmetric,  the  two  players  will  have  the  same  strategies.  The  solution  can 
be  obtained  by  examining  square  submatrices  and  using  dominance  argu¬ 
ments.  There  are  four  extreme  points  and  they  are  close  to  one  another: 
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(0,  o,  a,  o,  A,  0,  A,  0,  o), 

(0,  0,  %Y,  0,  tj4,  0,  -gY;  0,  0), 

(0,  0,  ff,  0,  if,  0,  if,  0,  0), 

(0,  o,  If,  0,  n,  0,  a,  o,  0). 

It  is  also  interesting  to  note  that  the  essential  submatrix  has  rank  zero ;  it 
is  the  matrix 

"0  0  0" 

0  0  0. 

_°  0  °_ 

15.  RECONNAISSANCE  AS  A  GAME  OF  STRATEGY 

The  advisability  of  reconnaissance  before  attack  can  be  investigated  by 
considering  the  problem  as  a  game  of  strategy.  The  purpose  of  reconnais¬ 
sance  is  to  obtain  information  about  the  enemy’s  strategic  intentions. 
Having  the  information,  the  attacker  can  more  effectively  plan  his  attack. 
However,  the  attempt  to  obtain  information  is  costly  and  it  may  not  suc¬ 
ceed  if  the  enemy  invokes  effective  countermeasures.  This  conflict  of  in¬ 
terest,  whether  or  not  to  reconnoiter,  can  be  resolved  within  the  framework 
of  a  game  of  strategy. 

To  illustrate  the  essential  principles,  we  shall  solve  a  hypothetical  re¬ 
connaissance  problem  associated  with  a  small  number  of  strategies  for  each 
side.  No  additional  principles  are  introduced  if  the  players  have  a  large 
number  of  strategies.  Thus  we  shall  assume  that  the  attacker  and  defender 
have  two  strategies  each. 

Let  us  assume  that  the  attacker,  Blue,  wishes  to  seize  a  defended  enemy 
position.  For  simplicity,  let  us  assume  that  he  has  two  courses  of  action, 
his  two  strategies: 

©  attack  with  his  entire  force; 

©  attack  with  part  of  his  force,  leaving  the  remainder  as  reserves  and 
a  rear  guard  in  case  the  enemy  “outflanks”  him. 

The  defender,  Red,  is  assumed  to  have  two  possible  courses  of  action, 
his  two  strategies: 

[T]  defend  with  his  entire  force  the  objective  of  the  attacker; 

pil  defend  with  part  of  his  force,  and  send  the  remainder  to  “outflank” 
the  enemy  and  attack  the  enemy  from  the  rear. 

There  are  four  possible  outcomes  of  the  above  courses  of  action.  They 
can  be  summarized  by  the  following  2X2  matrix: 

0  0 
©ran  «„-] 

©  La2i  a22j 
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where,  for  example,  a2 1  represents  the  value  to  Blue  if  he  attacks  with  part 
of  his  force  and  Red  defends  with  his  entire  force. 

Suppose  that  the  outcomes  are  such  that  if  Red  uses  strategy  GO,  then 
Blue  would  prefer  to  use  strategy  ®,  i.e.,  an  >  a2 1,  and  if  Red  uses  strategy 
[¥],  then  Blue  would  prefer  to  use  strategy  ©.  Clearly,  the  attacker  could 
benefit  from  a  knowledge  of  the  defender’s  intentions.  Thus,  the  attacker 
might  find  it  profitable  to  send  out  a  detachment  of  men  to  reconnoiter  in 
an  attempt  to  discover  the  plans  of  the  defender.  In  order  to  defend  him¬ 
self  against  such  possible  action  the  defender  may  take  countermeasures. 

Now  if  the  attacker  decides  to  reconnoiter  he  must  sacrifice  some  of  his 
attacking  forces.  If  the  defender  decides  to  take  countermeasures  he  must 
sacrifice  some  of  his  defensive  forces.  Let  c  be  the  cost  to  the  attacker  in 
in  order  to  reconnoiter  and  d,  be  the  cost  to  the  defender  in  order  to  use 
countermeasures. 

The  decisions  of  the  attacker,  whether  or  not  to  reconnoiter,  and  of  the 
defender,  whether  or  not  to  use  countermeasures,  have  increased  the  num¬ 
ber  of  strategies  available  to  the  two  sides.  Whereas  the  original  game  had 
only  two  strategies  for  each  side,  the  resulting  reconnaissance  game  will  be 
seen  to  have  16  strategies  for  the  attacker  and  4  for  the  defender.  Of 
course,  many  of  the  strategies  will  turn  out  to  be  poor  strategies.  But  in 
establishing  the  game  we  must  enumerate  all  of  them. 

A  strategy  for  the  attacker  will  be  a  set  of  instructions  which  tell  him 
how  to  act  taking  into  account  the  information  he  may  receive.  A  con¬ 
venient  way  to  represent  symbolically  a  strategy  for  the  attacker  is  by  an 
ordered  sequence  of  numbers  ( u ;  x,  y,  z )  in  which  each  letter  takes  on  the 
value  1  or  2,  and  they  have  the  following  meanings: 

u  —  1.  Reconnoiter. 

u  =  2.  Do  not  reconnoiter. 

x  =  1.  Play  strategy  ©  if  no  information  is  received  about  the 
defender. 

x  =  2.  Play  strategy  ©  if  no  information  is  received  about  the 
defender. 

y  =  1.  Play  strategy  ®  if  the  information  indicates  that  the  defender 
is  using  strategy  [T]. 

y  =  2.  Play  strategy  ©  if  the  information  indicates  that  the  defender 
is  using  strategy  0. 

2=1.  Play  strategy  ®  if  the  information  indicates  that  the  defender 
is  using  strategy  ©. 

2=2.  Play  strategy  ©  if  the  information  indicates  that  the  defender 
is  using  strategy  ©. 

For  example,  the  strategy  (1;  1,  2,  1)  instructs  the  attacker  to  recon¬ 
noiter,  and,  if  no  information  is  obtained,  to  play  strategy  ® — attack  with 
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all  his  forces;  if  the  information  indicates  that  the  enemy  is  using  strategy 
[T|  play  strategy  ©;  if  the  information  indicates  that  the  enemy  is  using 
strategy  @  play  strategy  ®.  There  will  be  24  =  16  different  strategies  for 
the  attacker.  Some  of  them  are  redundant,  and  they  will  show  up  as  such 
in  the  payoff  matrix. 

A  strategy  for  the  defender  is  an  ordered  sequence  (s;  t)  where  each 
letter  takes  on  the  value  1  or  2  with  the  following  meanings: 

s  =  1.  Take  countermeasures. 

s  =  2.  Take  no  countermeasures. 

t  —  1.  Use  strategy  ®. 

t  =2.  Use  strategy  00 . 

Thus  (1 ;  2)  means  that  Red  takes  countermeasures  and  uses  strategy 
©,  i.e.,  defends  with  a  partial  force. 

With  this  notation  for  strategies,  the  64  possible  outcomes  of  the  game 
can  be  summarized  by  the  payoff  matrix  Ai  given  by: 

Reconnaissance  Patoff 


(i;  i) 

(i;  2) 

(2;  1) 

(2;  2) 

(i;b  i,  i) 

ail  —  c  +  d 

Gl2  —  C  +  d 

Gil  —  C 

Oi2  —  C 

(l;  l,  i,  2) 

Ou  —  c  +  d 

dl2  —  c  d 

Ou  C 

a22  —  c 

d;i,  2,  i) 

On  —  c  +  d 

dn  —  c  +  d 

a2i  -  c 

O12  —  c 

d;i,  2, 2) 

ou  —  c  +  d 

G12  —  c  +  d 

a2 1  —  c 

O22  —  C 

(l;  2,  l,  l) 

021  —  c  +  d 

CL-n  —  C  -f*  d 

Oil  -  C 

Oi2  —  c 

(i;  2,  l,  2) 

O21  —  c  +  d 

On  —  c  +  d 

On  —  c 

O22  —  C 

(i;  2, 2,  l) 

O21  —  c  -f-  d 

(X22  —  c  +  d 

021  —  c 

O12  —  c 

(1;  2,  2,  2) 

0 2t  —  c  d 

a22  —  c  +  d 

O21  —  c 

O22  —  c 

(2;  l,  l,  l) 

on  +  d 

an  "I-  d 

Oil 

012 

(2;  1,  1,  2) 

On  +  d 

an  d 

Oil 

022 

(2;  1,  2,  1) 

On  +  d 

an  +  d 

021 

012 

(2;  1,  2,  2) 

an  +  d 

G12  +  d 

021 

022 

(2;  2, 1, 1) 

O21  +  d 

G22  ~h  d 

On 

012 

(2;  2, 1,  2) 

021  d*  d 

G22  ~f"  d 

Oil 

022 

(2;  2,  2,  1) 

021  d~  d 

O22  +  d 

021 

021 

(2;  2,  2,  2) 

_ On  +  d 

G22  +  d 

o2l 

O22  _ 

For  example,  if  Blue  uses  strategy  (1;  1,  1,  2)  and  Red  uses  strategy 
(1;  2)  the  payoff  an  —  c  +  d  is  computed  as  follows:  Blue  reconnoiters  at 
a  cost  c  but  receives  no  information  about  the  defender,  and  so  he  selects 
strategy  ®.  Red  takes  countermeasures  at  a  cost  d  to  him  or  a  gain  of  d 
by  Blue,  and  uses  strategy  Uj.  Therefore  the  payoff  to  Blue  is  an  +  d  —  c. 

This  sixteen-rowed  matrix  may  be  reduced  to  a  four-rowed  matrix  by 
testing  for  dominance.  Every  odd  ^ow  is  dominated  by  the  next  even  row, 
which  eliminates  eight  rows.  Among  the  even  rows  every  (2k  +  2)th  row 
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is  dominated  by  the  (2ft)th  row  where  k  =  1,  3,  5,  7.  This  eliminates  four 
more  rows,  leaving  the  following  4X4  matrix: 


(i;  i,  i,2) 

t>  =  (i;  2,  l,  2) 

(2;  l,  l,  2) 

(2;  2,  1,  2) 


(1,1) 

on  —  c  +  d 
On  —  c  -J-  d 
On  +  d 
On  +  d 


(1,2) 

On  —  c  +  d 
022  —  c  ~{-  d 
On  ~f~  d 
O22  -}-  d 


(2,1) 
an  -  c 
On  —  c 
Oil 
an 


(2,  2)  ^ 
0-22  —  C 
0>22  —  C 
Cl22 
&22 


It  is  evident  from  the  matrix  B  that  the  decision  by  the  attacker  of 
whether  or  not  to  reconnoiter  and  by  the  defender  of  whether  or  not  to 
invoke  countermeasures  will  depend  on  the  costs  c  and  d.  A  solution  of 
game  B,  i.e.,  the  computation  of  a  pair  of  optimum  strategies,  will  indicate 
the  advisability  of  reconnoitering  and  the  use  of  countermeasures. 

The  reconnaissance  problem  we  have  described  is  relatively  simple.  It 
can  be  expanded  to  include  the  situation  where  the  attacker  receives  par¬ 
tial  information  about  his  enemy,  or  where  the  attacker  uses  several  types 
of  reconnaissance.  The  same  approach  can  be  used  to  compare  different 
types  of  reconnaissance  in  order  to  select  the  best  type. 

Returning  to  the  original  problem,  let  us  solve  a  particular  reconnais¬ 
sance  game.  Suppose  the  matrix  A  is  given  by 


0  0 
48  24] 
12  36  J’ 


and  let  c  =  9,  d  =  7.  Then  the  matrix  B  becomes 


(1,1) 

(1,2) 

(2,  1) 

(2,  2) 

(i;  1, 1,2) 

"  46 

22 

39 

27 

(i;  2, 1, 2) 

10 

34 

39 

27 

(2;  1,  l,  2) 

55 

31 

48 

24 

(2;  2, 1, 2) 

19 

43 

12 

36 

This  game  has  no  saddle-point.  Since  no  row  (column)  dominates  any 
other  row  (column),  it  is  necessary  to  solve  a  4  X  4  game.  This  can  be 
done  in  several  ways.  However,  the  reader  can  verify  that  the  following 
are  solutions  of  the  game : 

(a)  For  Blue,  the  attacker, 


x'i  —  (A,  A,  0,  A), 

—  ( 17  7  n  i2^ 

x2  —  V1T6  7  T7T>  ^67  > 


—  (C\  28  5 1  _6 

X3  \Uf  144  >  14  4?  14 


-  /28  n  3  17 

x*  —  Vi§ J  u>  ¥¥>  TS 
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(b)  For  Red,  the  defender, 

Y'  =  (0,  0,  i  f). 

The  value  of  the  game  is  30. 

Thus  the  defender  never  takes  countermeasures  and  defends  |  of  the 
time  with  his  entire  force  and  f  of  the  time  with  part  of  his  force.  The 
attacker  has  many  optimal  possibilities  open  to  him,  given  by  his  large 
number  of  optimal  strategies.  Among  his  possibilities  is  the  following:  rec- 
onnoiter  with  probability  §  and  not  reconnoiter  with  probability  §.  If  he 
does  reconnoiter  and  receives  no  information,  then  with  probability  f  he 
uses  strategy  ©  and  with  probability  |  he  uses  strategy  ©;  if  he  does  recon¬ 
noiter  and  receives  information  that  Red  is  using  strategy  0,  then  Blue 
selects  strategy  ®  with  probability  §  and  strategy  ©  with  probability  §, 
and  similarly  for  strategy  00.  If  Blue  does  not  reconnoiter,  he  always  uses 
strategy  ©,  or  leaves  a  rear  guard. 


16.  APPLICATION  OF  STRUCTURE  THEOREMS  TO  RECONNAISSANCE 

In  the  particular  reconnaissance  game  solved  in  the  previous  section, 
we  listed  four  basic  solutions  for  Blue  and  one  for  Red.  Every  linear  com¬ 
bination  of  these  four  basic  solutions  will  be  a  solution  of  the  game  for  Blue. 
We  wish  to  verify  that  there  are  no  other  solutions. 

Since  the  matrix  B  of  the  reconnaissance  game  is  4  X  4  we  can  repre¬ 
sent  each  mixed  strategy  for  player  I  by  a  point  in  a  tetrahedron,  S(I). 
The  four  basic  solutions  Xh  X2,  X:l,  and  Xt  can  be  shown  to  lie  on  a  plane. 
Taking  every  linear  combination  of  these  four  basic  solutions  we  obtain 
X(B),  the  set  of  all  solutions.  X{B)  will  be  a  plane  in  the  tetrahedron. 


Figure  7 


In  the  diagram  (Fig.  7),  S(I)  is  the  tetrahedron  whose  vertices  are  A, 
B,  C,  D.  The  four  triangles  are  the  four  faces  of  the  tetrahedron.  X(B)  is 
the  quadrilateral  whose  vertices  are  Xi,  X2,  Xt,  Xt.  The  point  A,  for 
example,  is  defined  by  X\  —  1  and  therefore  is  written  as  A(Xi  —  1).  Now 
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the  smallest  closed  simplicial  face  which  contains  X{B)  is  the  tetrahedron, 
or 

<S(/j)  =  <S(7)  and  dim  s(h)  —  dim  S(I)  =  3. 

We  can  represent  Red’s  strategies  geometrically  (Fig.  8).  S(J)  will  be 
a  tetrahedron.  Since  Red  has  a  unique  strategy  Y  =  (0,  0,  J,  f),  the  set 
Y (B)  is  a  point  F  on  the  line  D'C' . 


A'(yr-\) 


Figure  8 

The  smallest  closed  simplicial  face  containing  Y (B)  is  the  line  D'C',  or 
dim  S(Ji)  =  1.  Since  Y  is  unique  and  its  first  two  components  are  zero, 
it  follows  that  the  essential  submatrix  Bi  consists  of  four  rows  and  the 
last  two  columns.  The  rank  of  this  matrix  is  2.  Substituting  in  the 
dimensionality  relationship, 

dim  S(h)  -  dim  X(B)  =  dim  S(Ji)  -  dim  Y(B) 

—  rank  Bi  —  1, 

we  find  that  the  relationship  is  satisfied,  for 

3  —  2+1  —  0  =  2  —  1. 

Now 

K[  =  X[B  =  (34,  30,  30,  30), 

K.2  =  X’2B  =  (30,  31i  30,  30), 

Ki  =  XiB  =  (30,  37,  30,  30), 

K’i  =  X'iB  =  (37,  30,  30,  30), 

H'  =  BY  =  (30,  30,  30,  30). 

By  definition, 

h  =  {1,2,  3,  4), 

Ji=  {3,4}. 

An  examination  of  the  above  K" s  and  H'  shows  that 

=  {3,4}, 
h  =  {1,2,  3,  4}, 


and  so 
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h  =  h, 

Jl  —  J 1. 

If  it  were  the  case  that  dim  X(B)  <  2,  the  above  verification  would  be 
sufficient  to  insure  that  X(B)  and  Y(B)  are  the  complete  sets  of  optimal 
strategies.  However,  dim  X(B)  =  2  in  this  example,  and  it  is  necessary  to 
account  for  each  face  in  X(B).  Now  each  face  can  be  accounted  for  in 
either  of  the  following  two  ways:  (1)  the  element  X;  may  be  zero  in  the 
two  extreme  points  which  span  the  face;  (2)  the  same  component  fc,-,  in 
the  two  K’s  associated  with  the  two  extreme  points  which  span  the  face, 
is  equal  to  v  and  does  not  belong  to  J 2.  Consider  the  reproduction  of 
X{B)  (Fig.  9). 


Xz  I  X, 

Figure  9 


(If  1  0,  -jg,  hff)  {  X 3  —  (0,  Yi8S>  T3TI>  rA) 

(37,  30,  30,  30)  {  =  (30,  37,  30,  30) 

(t5I  1V1  0,  xfc)  (  X2  =  (-g-g-,  0,  -g!) 

(34,  30,  30,  30)  {  K'2  =  (30,  311,  30,  30) 

All  of  the  faces  except  the  face  numbered  3  are  accounted  for  by  this 
process.  For  example,  face  number  1  is  accounted  for  by  the  fact  that 
Xz  =  0  in  both  X[  and  X£.  The  fact  that  face  number  3  is  not  accounted  for 
means  there  exists  at  least  one  more  extreme  point  between  these  two.  To 
find  this  extreme  point  glance  at  the  sketch  of  X(B)  and  its  relation  to 
iS(jT).  It  is  reasonable  to  believe  that  there  exists  only  one  additional  ex¬ 
treme  point  and  that  it  occurs  at  the  intersection  of  the  plane  defined  by 
X(B)  and  the  line  DC  of  the  tetrahedron.  One  of  the  reasons  for  so  believing 
is  that  the  existence  of  such  an  extreme  point  would  not  disturb  the  previous 
satisfaction  of  the  first  two  conditions.  Such  a  point  then,  if  it  exists, 
must  be  of  the  form 
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X'5  =  aD  +  (1  —  a)C,  0  <  a  <  1, 

where  D  =  (0,  0,  0,  1), 

C  =  (0,  0,  1,  0). 

The  unknown  a  can  be  found  by  observing  that,  since  Ji  =  {3,  4},  the 
third  and  fourth  components  of  X’5B  are  v  and  one  finds 

a  =  \. 

Hence  X'5  -  (0,  0,  §,  §). 

The  two  new  faces  created  by  the  discovery  of  X's  are  readily  explained. 

The  last  extreme  point  has  the  property  that  if  Blue  uses  this  as  an 
optimal  strategy,  it  is  uniformly  better  than  any  other  strategy  in  X  (B) 
in  that  it  guarantees  Blue  a  higher  expectation  than  any  other  strategy 
in  X{B)  if  Red  deviates  from  his  optimal  strategy. 


17.  ATTACK  ON  HIDDEN-OBJECT 

An  interesting  application  of  games  of  strategy  is  to  the  general  prob¬ 
lem  of  destroying  a  hidden-object.  A  defender  has  an  object  of  great  value 
which  he  may  conceal  in  any  one  of  many  containers.  The  attacker  makes 
a  series  of  attempts  to  destroy  the  object  by  destroying  the  containers. 
Which  container  shall  the  defender  use  to  hide  the  object  and  which 
containers  shall  the  attacker  attempt  to  destroy? 

An  example  of  this  problem  is  the  following  tactical  situation :  A  bomb 
is  to  be  carried  in  one  of  two  identical  bombers,  called  P  (protected)  and 
F  (flank).  The  bombers  fly  in  formation,  say  one  behind  the  other,  so  that 
a  hostile  fighter,  wishing  to  attack  the  leader,  must  pass  through  the  field 
of  fire  of  the  trailer  and  run  the  risk,  a,  of  being  shot  down  before  being  able 
to  close  in  on  its  target.  Once  it  has  engaged  its  target,  however,  the 
fighter  can  destroy  it  with  probability  /?.  Since  each  attacking  fighter  has 
an  opportunity  for  just  one  pass,  its  ultimate  survival  is  of  no  concern.  The 
fate  of  the  bomb  is  supposed  to  be  more  important,  disproportionately, 
than  the  fates  of  any  of  the  aircraft. 

The  strategies  of  the  two  players  are  easily  described.  The  defender, 
Red,  chooses  P  or  F,  where  he  locates  the  valuable  object.  The  attacker, 
Blue,  chooses  an  ordered  set  of  n  members,  such  as 

F  P  F  F  P 

representing  his  target  choice  on  each  pass,  provided  the  previous  passes 
have  all  failed.  It  is  agreed  that  if  one  attack  has  succeeded,  the  subse¬ 
quent  tries  are  to  be  directed  against  the  other  container  either  until  it, 
too,  has  been  destroyed  or  until  Blue  has  used  up  his  allotment  of  n  tries. 
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If  F  is  intact,  it  is  harder  to  destroy  P  than  if  F  is  gone.  If  F  is  de¬ 
stroyed,  the  probability  of  destroying  P  is  p.  If  F  is  intact,  the  probability 
of  destroying  P  is  (1  —  a)p  —  y. 

Let  the  payoff  to  the  attacker  be  the  expectation  of  destroying  the 
correct  container;  then  the  game  is  zero-sum. 

One  fighter.  If  the  attacker  is  permitted  only  one  attack,  then  he  has 
two  strategies — attack  P  and  attack  F.  The  defender  also  has  two 
strategies — conceal  the  valued  object  in  P  and  conceal  the  valued  object 
in  F.  The  payoff  for  each  outcome  is  given  by  the  following  2X2  matrix. 


Hidden-Object  Payoff 


Conceal  Conceal 
in  P  in  F 

Attack  P  T  7  0  ”1 

Attack  F  (_  0  |8  J 


Solving  this  game,  we  obtain  that  the  value  to  the  attacker  is 


v  — 


Py 

P  +  y' 


The  optimal  strategies  are  always  mixed,  and  are  the  same  for  the  two 
players: 


play  P  with  probability 
play  F  with  probability 


P 


P  +  y 
y 

P  +  y 


Two  fighters.  If  the  attacker  is  permitted  two  attacks,  he  has  four  strate¬ 
gies  and  the  defender  has  two  strategies.  The  payoff  matrix  is  obtained  by 
computing  the  compound  probabilities  and  is  given  by 


P,P 
P,F 
F,  P 


F,F  Ip 


P 

'2y  -  y2 

y 

P2  -  py  +  y 


Py 
P 
P 
2  P 


P2J 


To  solve  this  4X2  game  we  note  that  for  0  <  y  <  P  <  1, 
y  <  P2  —  py  +  y.  Thus  the  second  row  is  dominated  by  the  third,  and 
may  be  eliminated.  A  further  examination  of  some  inequality  relation¬ 
ships  between  P  and  y  will  eliminate  the  fourth  row,  leaving  a  2  X  2  game 
to  be  solved.  The  solution  is  as  follows: 


(i)  If  P2  —  Py  +  72  —  y  >  0,  the  optimal  strategies  are,  for  the 
attacker, 
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y(l  —  6)  +7(1  —  -y) 
attack  P,  P  with  probability  ^  ^ 

attack  F,  P  with  probability  > 

and  for  the  defender, 

i8(l  —  7) 

conceal  in  P  with  probability  ^  ^  qp  ^ 

ft( 7  —  ft)  7(1  —  7) 
conceal  in  F  with  probability  ^  ^ 

The  value  to  the  attacker  is 


v  =  fty 


ft(y  -ft)  +  2(1  -  7) 

0(1  -  0)  +  7(1  -  7) 


(ii)  If  ft2  —  fty  +  t2  —  7  <  0,  the  optimal  strategies  are,  for  the 
attacker, 

attack  F,  P; 

and  for  the  defender, 

conceal  in  P. 


The  value  of  the  game  is 

v  =  ft2  —  fty  +  7. 

n  fighters.  A  typical  strategy  for  the  attacker  in  this  case  is  an  ordered 
set  such  as  P,  F,  F,  .  .  .  ,  P,  representing  his  target  choice  on  each  of  his 
n  passes,  provided  the  previous  passes  have  all  failed.  First,  we  note  that 
any  strategy  in  which  F  appears  exactly  r  times,  where  0  <  r  <  n,  is 
dominated  by  the  strategy 

Fr  =  F,  F, .  .  .  ,  F,  P,  P,  .  .  .  ,  P, 

with  r  F’s  followed  by  n  -  r  P’s.  This  means  that  if  the  attacker  plans  to 
make  r  attacks  on  F,  it  is  best  to  make  them  on  the  first  r  passes.  This 
is  the  reason  for  the  elimination  of  row  2  in  the  4X2  matrix  of  the 
two-fighter  case.  It  corresponded  to  the  strategy  P,  F. 

We  have  then,  strategies  Fr,  r  =  0,  1, .  .  . ,  n  for  the  attacker  and  P 
and  F  for  the  defender.  The  elements  a(r,  P)  and  a(r,  F)  of  the  (n  +  1)  X  2 
payoff  matrix  are  found  by  considering  the  ways  the  attacker  can  fail  to 
destroy  the  valuable  target.  Thus: 

a(r,  P)  =  1  -  (1  -  ftY(  1  -  y)r  -  r0(l  -  ft)n-\ 
o(r,  F)  =  1  -  (1  -  ftY(  1  -  7)"-r 

-  7(0  -  7)-X(l  -  0)r{(l  -  7)n“r  ~  (1  -  0)"-r}- 

The  game  is  solved  by  making  r  continuous  and  making  use  of  the 
concavity  in  r  (i.e.,  the  second  derivatives  are  negative). 
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Let  r0  be  the  solution  of 


and  let  R  —  | Then 
n  =  n 

The  solution  depends  on  the  value  of  r0.  If  a(r0,  P)  <  a(r0,  F),  then  the 
attacker  has  a  pure  optimal  strategy  r0  and  the  defender  also  has  a  pure 
optimal  strategy  P.  If  a(r0,  P)  >  a(r0,  F),  then  the  attacker  has  a  pure 
optimal  strategy  r  given  by  the  solution  of  the  equation 

rd(d  —  7)  _  /I  —  y\"~r 
7(1  —  /S)  \l-pj  ’ 

or  approximately  r  —  ny/(fi  +  y).  The  defender  has  an  optimal  strategy 
which  is  a  mixture  of  P  and  F. 

For  the  original  discrete  game,  define  the  integer  n  =  \ja\  and  solve 
the  2X2  matrix 

[  a(rh  P)  a(n,  F)  I 
La(n  +  1,  P)  a(n  +  1,  F)  J 

If  a(rh  P)  <  a(rh  F),  we  have  pure  strategy  solutions.  If  a(rl;  P)  >  a(rh  F), 
we  have  mixed  strategies  solutions. 

18.  SELECTING  A  PARTICULAR  OPTIMAL  STRATEGY 

If  a  player  has  many  optimal  strategies,  he  may  use  any  one  of  them 
and  he  can  expect  to  receive  at  least  an  amount  v,  the  value  of  the  game, 
regardless  of  the  strategy  used  by  his  opponent.  If  his  opponent  also  uses 
an  optimal  strategy,  then  the  player  can  expect  to  receive  exactly  this 
amount  v.  Thus  if  both  players  play  optimally,  there  is  no  reason  for 
preferring  one  optimal  strategy  over  another  except  for  reasons  extraneous 
to  the  game  model.  (Examples  of  extraneous  considerations  are  prefer¬ 
ences  for  pure  strategies  or  preferences  for  mixed  strategies  having  simple 
components.)  However,  if  we  permit  the  unlikely  possibility  that  an  op¬ 
ponent  may  make  some  mistake  and  fail  to  choose  one  of  his  optimal 
strategies  then,  by  selecting  a  particular  optimal  strategy,  a  player  can 
take  maximum  advantage  of  the  mistake. 

Among  the  solutions  for  each  player  there  exist  certain  strategies  which, 
in  addition  to  being  optimal,  take  maximum  advantage  of  the  mistakes, 
if  any,  of  one’s  opponent.  We  shall  describe  one  method  for  choosing  such 
an  optimal  strategy.  This  particular  optimal  strategy  will  be  chosen  on 
the  assumption  that  the  opponent  will  make  some  mistake,  unknown  in 


da(r,  P) 
dr 


=  0, 


In  8/ (1  —  j3)  —  In  In  P 
In  R 
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advance  to  the  player,  but  that  the  opponent  will  try  to  minimize  the 
result  of  such  mistake. 

Let  us  assume  that  Blue,  the  maximizing  player,  wishes  to  select  an 
optimal  strategy  which  takes  maximum  advantage  of  Red’s  possible  de¬ 
partures  from  his  set  of  optimal  strategies.  Let  the  payoff  matrix  be  A 
and  the  set  of  optimal  strategies  of  Blue  be  TX(A).  A  procedure  for  selecting 
a  particular  optimal  strategy  is  as  follows: 

(i)  Assume  that  Blue  uses  only  mixed  strategies  belonging  to  Ti(A). 
This  is  equivalent  to  replacing  the  maximizing  player’s  pure 
strategies  by  the  extreme  points  of  TX(A). 

(ii)  Assume  that  Red  uses  only  those  pure  strategies  which  yield  more 
than  the  value  of  the  game  to  Blue  for  at  least  one  optimal  strat¬ 
egy  of  Blue.  This  is  equivalent  to  deleting  from  Red’s  pure  strat¬ 
egies  all  pure  strategies  which  yield  the  value  of  the  game  against 
every  optimal  strategy  of  Blue. 

(iii)  Using  the  matrix  A,  compute  the  new  payoff  matrix  1  associated 
with  the  set  of  strategies  defined  in  (i)  and  (ii). 

(iv)  Find  the  optimal  strategies  Ti(A)  of  Blue.  It  is  evident  that  T l(A) 
is  contained  in  Ti(A).  If  Blue  uses  any  strategy  in  rI\(2).  he 
will  maximize  the  minimum  gain  for  any  departure  of  Red  from 
his  optimal  strategies. 

(v)  If  Ti(I)  is  not  unique,  the  above  process  may  be  repeated  for 
A  and  Ti(A).  The  process  mil  either  terminate  witha  single  strat¬ 
egy  for  Blue  or  every  element  of  the  payoff  matrix,  A  derived  from 
A  will  be  identical,  in  which  case  all  strategies  of  7\(Z)  are 
equivalent — they  all  take  maximum  advantage  of  mistakes. 

Example.  In  the  game  of  Morra  described  in  section  3  of  Chapter  1, 
we  had  the  following  9X9  payoff  matrix: 

"  0  2  2  -3  0  0  -4  0  0" 

-200033-400 
-2  00  -3  00044 

3030  -4  00  -5  0 

A  =  0  -3  04040  -5  0. 

0  -3  00  -4  0505 

44000  -5  00  -6 

0  0  -4  5  5  0  0  0  -6 

0  0  -4  0  0  -5  6  6  0_ 

The  value  of  this  game  is  zero.  It  can  be  verified  that  each  player’s  set 
of  solutions  has  the  following  four  extreme  points : 

(a)  =  (0,  0,  *,  0,  A,  0,  0,  0), 

(b)  =  (0,  0,  if,  0,  if,  0,  *,  0,  0), 
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(c)  =  (o,  o,  -n,  o,  a,  o,  if,  o,  o), 

(d)  =  (0,  o,  if,  0,  if,  0,  a,  0,  0). 

Replacing  Blue’s  nine  pure  strategies  by  the  four  extreme  points,  we 
obtain  the  following  4X9  matrix: 


B  = 


r  2 

TT 

0 

0 

1 

IT 

0 

1 

TT 

0 

0 

2 

TT 

4 

TT 

0 

0 

0 

0 

3 

TT 

0 

4 

TT 

10 

TT 

8 

4  7 

3 

4  7 

0 

0 

0 

0 

0 

5 

TT 

8 

4  7 

14 
1—6  1 

4 

6  1 

0 

5 

6  1 

0 

0 

0 

0 

4 

6  1- 

Deleting  from  Red’s  strategies  all  pure  strategies  which  yield  the  value 
of  the  game  against  every  optimal  strategy  of  Blue  (delete  columns  three, 
five,  and  seven),  we  obtain  the  4X6  matrix: 


I  = 


"  2 
TT 
4 

TT 


^  iV  iV  0  n 

no3  4  io 
u  u  TT  TT  TT 


8  3  0  0  5  8 

TT  4  7  u  u  47  TT 

0  0  A 


,14  4  5 

1—6  1  61  TgJ 


Solving  A,  we  find  that  Blue’s  set  of  solutions  has  two  extreme  points 


flgO  0  9  4  6  1 

vsvxi  v,  vt5> 

(  12  37  A  6  1  \ 

'  110 I  110!  U,  XlTT/’ 

In  terms  of  the  original  nine  strategies  of  Blue,  these  two  solutions  yield 
the  same  solution  of  A,  namely: 


(«)  =  (o.  o-  If-  o,  H,  o,  M.  o,  o). 

The  value  of  the  game  with  payoff  matrix  Z  is  ft.  Therefore  if  Blue 
uses  the  particular  optimal  strategy  (e),  he  will  gain  at  least  ff  if  Red  ever 
departs  from  an  optimal  strategy  in  the  original  game. 


GAMES  IN 
EXTENSIVE  FORM 


1.  REPRESENTATION  OF  GAMES 

Finite  games  can  be  conveniently  represented  with  the  help  of  topo¬ 
logical  trees.  We  shall  give  an  example  of  such  a  representation  for  the 
following  two-person  game.  Suppose  that  the  first  move  of  the  game  is 
made  by  Blue,  who  has  to  choose  from  among  four  alternatives.  We  rep¬ 
resent  this  situation  by  the  bottom  part  of  Fig.  10,  where  B  at  the  lowest 


point  of  the  figure  indicates  that  the  first  move  is  made  by  Blue,  and  the 
four  rising  lines  indicate  that  Blue  has  four  alternatives  at  this  move.  Let 
us  number  the  alternatives  at  this  first  move  (and,  indeed,  at  each  of  the 
succeeding  moves)  in  a  counterclockwise  sense  with  roman  numerals.  Thus 

74 
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alternative  I  at  the  first  move  is  the  one  corresponding  to  the  segment 
leading  to  the  vertex  marked  R  in  the  figure. 

Now  suppose  that,  if  Blue  chooses  I  on  this  first  move,  then  the  next 
move  is  to  be  made  by  Red,  who  in  turn  has  a  choice  from  among  three 
alternatives;  and  suppose  that  in  addition  it  is  given  that  if  Blue  chooses 
II  or  III  on  the  first  move,  then  the  next  move  is  made  by  Red,  who  in 
either  case  is  to  choose  from  two  alternatives.  This  is  indicated  by  putting 
an  R  at  the  end  of  the  segment  corresponding  to  the  first  choice  by  Blue, 
drawing  three  lines  upward  from  this  point,  putting  an  R  on  the  vertices 
corresponding  to  moves  II  and  III  of  Blue,  and  drawing  two  lines  upward 
from  each  of  these  points. 

Suppose,  moreover,  that  if  Blue  chooses  IV  on  the  first  move,  then  the 
next  move  is  made  by  a  chance  device  which  chooses  from  two  alternatives. 
Suppose  that  this  chance  device  is  such  that  it  assigns  probability  |  to  the 
first  chance  alternative,  and  f  to  the  second.  We  represent  this  in  Fig.  10 
by  putting  an  0  at  the  vertex  corresponding  to  the  move  made  by  chance ; 
and  putting  \  and  f  on  the  appropriate  lines  rising  from  this  vertex. 

Finally,  let  us  suppose  that  if  Blue  chooses  I  and  Red  chooses  III,  or 
if  Blue  chooses  II  and  Red  chooses  I,  then  the  next  move  is  to  be  made  by 
Blue  again;  that  he  then  has  a  choice  of  two  alternatives;  and  that  these 
are  the  only  possible  moves  in  the  game.  Then  the  tree  in  Fig.  10  represents 
the  complete  structure  of  the  moves  of  the  game. 

Any  unicursal  path  from  the  bottom  to  the  top  of  the  tree  represents 
a  play  of  the  game.  There  are  just  as  many  plays  as  there  are  top  points  of 
the  tree.  Thus  there  are  precisely  11  possible  ways  of  playing  the  game 
represented  in  Fig.  10.  For  each  of  these  ways  of  playing  the  game  there 
is  a  payoff  to  each  player. 

In  order  to  complete  the  description  of  the  game,  it  is  also  necessary  to 
specify  how  much  the  players  know  about  the  previous  choices  at  the  time 
they  make  their  moves.  Thus  suppose  that  when  Red  makes  his  choice  he 
does  not  know  whether  Blue  has  chosen  II  or  III  on  his  first  move ;  we  can 
indicate  this,  as  in  Fig.  10,  by  enclosing  the  two  corresponding  points  in 
a  region  bounded  by  a  broken  line.  We  call  such  a  set  of  vertices  among 
which  a  player  cannot  distinguish  when  he  makes  his  move  an  information 
set.  If  a  player  at  a  certain  point  is  completely  informed  about  the  past 
course  of  the  play,  the  corresponding  information  set  contains  only  the 
single  point,  in  which  case  we  omit  the  broken  lines. 

If  each  information  set  contains  just  one  vertex,  we  say  that  the  game 
is  one  with  perfect  information;  that  is,  each  player,  at  the  time  he  makes 
each  of  his  moves,  is  completely  informed  about  the  past  course  of  the  play. 

In  general,  a  finite  game  can  be  represented  as  above  by  a  finite  tree. 
The  partition  of  the  vertices  into  information  sets  is  subject  to  the  obvious 
restrictions  that  all  the  vertices  in  a  given  information  set  must  correspond 
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to  the  same  player,  and  must  present  the  same  number  of  alternatives. 
In  addition,  we  require  that  no  information  set  intersect  any  play  in  more 
than  one  point. 

A  strategy  for  a  player  will  be  a  function  which  is  defined  over  the  class 
of  his  information  sets,  and  for  each  information  set  picks  out  one  of  the 
available  alternatives.  Thus,  for  the  game  represented  in  Fig.  10,  let  a  be 
the  set  consisting  of  the  bottom  vertex  and  let  /3  be  the  set  consisting  of  the 
two  top  vertices;  then  a  strategy  for  Blue  is  a  function  /  such  that 

/(«)  e  { I,  II,  III,  IV}  and  /03)  e  { I,  II}. 

Thus  Blue  has  six  strategies  in  the  game.  For  example,  the  strategy 
(II,  I)  says  that  Blue  first  picks  his  second  alternative  and  on  his  second 
move  he  picks  his  first  alternative. 


2.  GAMES  WITH  PERFECT  INFORMATION— SADDLE-POINTS 

We  shall  now  examine  a  special  class  of  games — the  so-called  games 
with  “perfect  information.”  These  games  have  the  property  that  at  every 
point  in  every  play  each  player  whose  turn  it  is  to  move  knows  exactly 
what  choices  have  been  made  previously.  The  moves  are  made  alternately, 
which  means  that,  in  terms  of  the  graph,  each  information  set  consists  of 
one  element. 

Many  common  parlor  games  are  games  with  perfect  information. 
Ticktacktoe,  checkers,  chess,  backgammon,  for  instance,  are  games  with 
perfect  information. 

We  shall  show  that  the  payoff  matrix  of  any  zero-sum  two-person  game 
with  perfect  information  has  a  saddle-point — i.e.,  that  there  are  optimal 
pure  strategies  for  such  a  game. 

In  order  to  carry  out  the  proof,  it  is  convenient  to  introduce  the  notion 
of  the  truncation  of  a  game  with  perfect  information.  Truncations  are 
those  games  which  arise  from  a  given  game  if  the  first  move  is  deleted. 
The  number  of  truncations  is  the  number  of  alternatives  available  at  the 
first  move.  The  payoff  functions  for  these  truncated  games  are  the  original 
payoff  functions,  with  their  domains  of  definition  suitably  restricted.  We 
can  also  consider  truncation  of  a  strategy — it  picks  out  the  same  alterna¬ 
tives  at  the  branch  points  as  does  the  original  strategy. 

We  shall  prove  that  a  perfect-information  game  has  a  saddle-point  by 
an  induction  on  the  length  of  the  game — i.e.,  on  the  number  of  branch 
points  in  the  longest  possible  play  of  the  game. 

For  games  of  length  1  (i.e.,  games  with  one  move)  the  theorem  is  ob¬ 
vious.  Now  suppose  that  the  theorem  is  true  for  all  games  of  length  less 
than  fc.  Let  F  be  a  game  of  length  k.  Suppose  there  are  r  alternatives  on 
the  first  move,  and  let  Fj,  lb,  •  .  .  ,  Trbe  the  r  truncations  of  F.  Let  Ai  B)  be 
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Blue’s  set  of  pure  strategies  for  the  game  I\.  Let  AW)  and  Aw  be  Blue’s 
and  Red’s  set  of  pure  strategies,  respectively,  in  the  game  Y . 

By  the  induction  hypothesis,  there  is  a  saddle-point  in  each  of  the 
games  IV  For  each  i,  let  f*,  g\  be  a  saddle-point.  Let  Mt  be  the  payoff 
function  in  the  game  I\,  then  we  have 

f  Mm,  9t  >  Miifi,  gt 

4-1  I  Mm,  gt)  <  Mm,  Qi) 

for  i  =  1,2,  ...  ,r  and  for  ft  any  member  of  A[n)  and  g,  any  member 
of  AiR). 

We  need  to  distinguish  three  cases  according  to  who  makes  the  first 
move — Red,  Blue,  or  chance. 

Case  1.  The  first  move  of  F  is  a  chance  move.  Letting  q  lie  a  branch 
point  of  one  of  the  truncated  games  T,,  and  letting  it  correspond  to  a  move 
made  by  Blue,  we  set 

/*(?)  =  /*(?)• 

If  q  corresponds  to  a  move  of  Red,  then  set 

g*(q)  =  <?*(<?)• 

Since  the  first  move  is  made  by  chance,  it  is  clear  that  f*  is  defined  over 
every  branch  of  F  which  corresponds  to  a  move  made  by  Blue,  and  hence 
is  a  member  of  A(B).  We  shall  show  that  /*,  g*  is  a  saddle-point  of  T . 

Let  the  probabilities  assigned  to  the  r  alternatives  at  the  first  move  be 
oil,  at,  ,  ar.  Let  M  be  the  payoff  function  in  the  game  T .  Then,  if  /  and 
g  are  strategies  in  T, 

M{f,  g)  =  S  cuMiUu  9i)- 

In  particular,  since  /*, .  .  . ,  f*  are  truncations  of  /*  and  g*, ...  ,g*  are 
truncations  of  g*,  we  have 


(4.2) 

and 


r  MU,  g*)  =  s  aMiUi,  (A) 
[MU*,  g)  =  2  atMm,  Qi) 


(4.3)  MU*,  g*)  =  2  ViMiUl  Q*)- 
From  (4.1)  and  (4.2)  it  follows  that 

2  aMiUi,  gt)  <  2  aMiUl  gt  =  MU*,  9*), 
2  aMiUl  Qi)  >  2  *Mm,  gt  =  M(f*,  g*). 

Hence  (/*,  g*)  is  a  saddle-point  of  T. 

Case  2.  The  first  move,  go,  of  F  is  made  by  Blue.  Let 

(4.4)  max  Mi{f*,  Q*t  =  M^ft  Qt- 

i  <r 
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Now  define  a  function  /*  by  setting 

(4.5)  f*(q0)  =  n  and  f*(q)  =  f*t(q ) 

for  any  point  q  in  the  truncated  games  which  correspond  to  a  move 
made  by  Blue.  We  define  g*  in  the  same  way  as  in  the  previous  case. 

It  is  clear  that  /*  and  g*  are  strategies  in  T.  We  shall  show  that  they 
yield  a  saddle-point  of  r. 

If  g  is  any  strategy  for  Red  in  F,  and  if  <7,  is  its  truncation  to  r„  then 
M(f*,  g )  =  MMl,  fir,). 

In  particular,  we  have 

(4-6)  M(f*,  g*)  =  Mtfl,  g*). 

Thus,  if  g  is  any  strategy  for  Red  in  T,  and  if  fir,  is  its  truncation  to  T„ 
we  see  that 

M(f*,  fir*)  =  Jf,(£,  gt)  <  M,(/*,  g,)  =  M(/*,  g). 

Now  let  /  be  any  strategy  for  Blue  in  F,  and  suppose  that  /  picks  out 
the  ith  alternative  on  the  first  move,  i.e.,  suppose  that 

/(r?o)  =  i- 

Let  fi  be  the  truncation  of  /  to  F,.  Then  we  see  that,  if  g  is  any  strategy 
for  Red  in  T,  and  if  gt  is  its  truncation  to  r„ 

M(f,  g)  =  Mi (fi}  g,). 

In  particular, 

M{f,  g*)  =  ¥,(/;,  gt). 

But,  from  (4.4)  we  have 

g*)  >  fi). 

Hence,  we  can  conclude  from  (4.6)  that 

M(/*,  fir*)  =  M,(/*,  ff*)  >  Mi(fl  fi)  >  Miifi,  gt)  =  M(f,  g*). 

We  see  that  (/*,  g*)  is  a  saddle-point  as  was  to  be  shown. 

Case  8.  The  first  move  of  r  is  made  by  Red.  This  is  analogous  to  the 
preceding  case. 


5  METHODS  OF 

SOLVING  GAMES 


1.  SOLVING  FOR  OPTIMAL  STRATEGIES 

From  the  fundamental  theorem  of  game  theory,  the  minimax  theorem, 
it  follows  that  each  player  has  an  optimal  strategy.  Using  an  optimal  strat¬ 
egy  (generally  a  mixed  strategy),  a  player  can  expect  to  win  a  fixed  amount 
(the  game  value)  regardless  of  the  strategy  selected  by  his  opponent,  and 
this  fixed  amount  is  as  large  as  is  strategically  possible.  Of  course,  he  may 
win  more  than  this  fixed  amount  from  his  opponent  if  his  opponent  does 
not  use  an  optimal  strategy.  In  order  to  protect  himself  against  this  un¬ 
necessary  loss,  the  opponent  will  also  use  an  optimal  strategy.  We  have 
referred  to  a  pair  of  optimal  strategies,  one  for  each  player,  as  a  solution  of 
the  game. 

The  determination  of  all  solutions  of  a  game — or  of  even  a  single  solu¬ 
tion — is  generally  a  lengthy  computing  task.  As  might  be  expected,  the 
volume  of  computations  increases  with  the  number  of  pure  strategies 
available  to  the  two  players,  or  the  number  of  elements  in  the  payoff 
matrix.  If  at  least  one  player  has  2  or  3  pure  strategies,  it  is  possible 
to  represent  the  strategies  in  two  dimensions  and  then  obtain  a  graphical 
solution. 

We  shall  discuss  various  methods  of  obtaining  some  solution  of  a  game. 
We  shall  also  describe  methods  for  getting  all  solutions.  Single  solutions 
will  be  obtained  either  by  guessing  a  solution  and  then  verifying  it,  or  by 
successive  approximations.  To  obtain  all  solutions  we  shall,  in  effect,  solve 
a  large  number  of  linear  systems  of  equations. 

Since  the  volume  of  computing  required  to  obtain  a  solution  depends 
on  the  number  of  strategies,  it  is  important  to  reduce  this  number  when¬ 
ever  possible.  The  dominance  criteria  discussed  in  the  previous  chapter 
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can  be  applied  for  this  purpose.  It  will  be  recalled  that  elimination  by 
nonstrict  dominance  may  lose  some  solutions  but  elimination  by  strict 
dominance  preserves  all  solutions. 

We  have  already  described  one  method  for  getting  a  solution — namely, 
the  simplex  algorithm.  We  shall  now  discuss  five  other  methods  of  solving 
a  game.  They  are :  guess  and  verify,  examination  of  submatrices,  successive 
approximations,  graphical,  and  mapping. 

2.  GUESS  AND  VERIFY 

Given  a  payoff  matrix  with  a  large  number  of  elements  we  first  examine 
it  for  pure  strategy  solutions  or  saddle-points.  This  can  be  done  very  easily. 
In  order  that  a  game  have  a  pure  strategy  solution,  there  must  exist  an 
element  in  the  payoff  matrix  which  is  simultaneously  the  minimum  of  its 
row  and  the  maximum  of  its  column.  If  no  such  element  exists  there  are 
no  pure  strategy  solutions. 

Example.  The  game  defined  by 

Min  of  row 
"  3  7  4  S']  3 

13  10  7  8  7 

10  4  1  9  1 

_  3  5  6  7j  3 

Max  of  column  13  10  7  9 

has  a  pure  strategy  solution — Blue’s  second  strategy  and  Red’s  third 
strategy — and  the  value  of  the  game  is  7,  since  this  element  is  the  minimum 
of  row  2  and  the  maximum  of  column  3. 

If  the  game  does  not  have  a  pure  strategy  solution,  i.e.,  a  saddle-point, 
and  therefore  has  a  mixed  strategy  solution,  it  may  be  possible  to  guess 
a  solution  or  the  form  of  a  solution.  Some  idea  of  the  form  of  the  solution — 
e.g.,  the  pure  strategies  which  must  be  in  the  solution — can  be  obtained 
from  the  particular  game  being  solved  or  from  the  elements  in  the  payoff 
matrix. 

If  an  analysis  of  the  background  of  the  game  provides  a  guess  of  a 
solution  of  the  game,  we  can  verify  it  very  readily.  In  order  for  Xo,  F0  to 
be  a  solution,  they  must  satisfy 

min  XoA  Y  =  max  X'A  Y0  =  X'0A  Y0. 

Y  X 

Example.  Suppose  that  the  payoff  matrix  is  given  by 
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We  guess  that  each  player  wishes  to  avoid  using  his  third  strategy,  since 
that  strategy  can  cause  a  loss  of  as  much  as  5  but  a  gain  of  at  most  2. 
We  therefore  guess  that  x3  =  2/3  =  0  and  then  we  need  consider  only  the 
smaller  game  defined  by  the  2X2  matrix, 

[-1  'I) 

It  is  obvious  that  this  smaller  game  has  for  its  solution  £1  =  £s  —  |, 
Vi  =  y%  —  i-  Since  for  the  original  game  we  have 

min  XoA  Y  =  max  X'A  Fo  =  0, 

Y  X 

it  follows  that  we  have  arrived  at  a  solution  to  the  game. 


3.  EXAMINATION  OF  SUBMATRICES 

The  method  of  examination  of  submatrices,  as  discussed  in  Chapter  3, 
provides  all  the  solutions  of  a  game  by  obtaining  certain  basic  solutions 
and  then  expressing  every  solution  as  a  linear  combination  of  these  basic 
solutions.  The  method  consists  of  testing  each  square  submatrix  of  the  pay¬ 
off  matrix  for  all  strategies  active,  and  then  testing  an  expanded  vector  for 
a  solution  of  the  full  matrix. 

Let  A  =  (oy)  be  the  payoff  matrix  having  m  rows  and  n  columns. 
Consider  some  r  X  r  submatrix  B  —  (fin)  of  A,  where  r  =  1,  2, .  .  . , 
min  (m,  n).  Let  us  test  whether  this  subgame  has  all  strategies  active  (i.e., 
whether  there  exists  some  Xo,  F0  such  that  Xo  BY  =  X'BY0  =  Xo  BY  a  for 
all  X  and  F).  This  can  be  done  (as  was  shown  in  Chapter  3)  by  construct¬ 
ing  the  adjoint  matrix  J  =  (Bjt)  of  the  matrix  B  and  then  computing  all 

the  row-sums  2  BJt  and  column-sums  2  Bi%-  If  all  2r  sums  are  non-negative, 

i  » 

or  all  nonpositive,  then  the  pair  of  r-component  vectors 

2  Bji  2  Bn 

Xi  =  2  2  Bj-  Vi  =  2  2  Bj- 

i  j  i  3 

provide  a  solution  of  the  subgame  B. 

It  is  next  necessary  to  test  whether  the  solution  of  the  subgame  B  is  a 
solution  of  the  game  A.  Introduce  zero  components  for  the  m  —  r  rows  and 
n  —  r  columns  of  the  matrix  A  which  did  not  appear  in  submatrix  B.  This 
yields  the  full  vectors  X  =  (x,),  F  =  (y,).  Test  whether  the  full  vectors 
X  and  F  solve  A — that  is,  whether 

n  m 

max  2  diiVi  —  min  2 


82  METHODS  OF  SOLVING  GAMES 

If  X,  Y  solves  A  then  it  is  said  to  be  a  basic  solution  of  A  and  the  value 
of  the  game  is 

|B|  .  m 

v  =  max  2  aiflj  =  '  =  mm  2  a.yz.- 

i  3  Zj  2j  j  t  =  l 

i  3 

If  this  procedure  is  applied  to  every  square  submatrix  of  A  we  shall 
obtain  all  the  basic  solutions  of  A.  Every  basic  solution  of  A  is  produced 
by  some  square  submatrix  of  A.  Since  there  are  a  finite  number  of  sub¬ 
matrices  of  A  there  will  be  a  finite  number  of  basic  solutions. 

All  solutions  of  A  are  obtained  by  taking  all  possible  convex  linear 
combinations  of  the  basic  solutions.  Thus  if  X\,  X2,  .  .  .  ,  Xi  are  the  l 
basic  solutions  of  Blue,  then  all  his  solutions  are  given  by 

ai-Xi  +  0:2X2  -J-  •  •  •  T  ociXi, 

1 

where  at  >  0  and  2  a<  =  1. 

i  =  l 

4.  SUCCESSIVE  APPROXIMATIONS 

It  is  possible  to  obtain  the  value  and  a  solution  of  a  given  game  by  a 
relatively  simple  method  of  successive  approximations.  The  method  re¬ 
quires  only  two  operations:  location  of  the  maximum  or  minimum  of  a 
discrete  set  of  numbers,  and  addition. 

Given  a  game  defined  by  a  payoff  matrix  A  =  («;,-) ,  whose  solution  is 
unknown,  then  one  way  of  determining  an  optimal  strategy  is  to  play  the 
game  many  times,  each  time  selecting  that  pure  strategy  which  is  best 
against  the  opponent’s  total  performance  to  that  play.  The  relative 
frequencies  of  these  strategies  will  yield  an  approximate  solution  to  the 
game. 

The  method  can  best  be  illustrated  by  an  example.  Suppose  we  are 
given  the  game  defined  by  the  payoff  matrix 


m 

s 

0 

2 

1 

0“ 

© 

2 

0 

3 

© 

_-i 

3 

—  3_ 

where  the  numbers  in  circles  represent  Blue’s  strategies  and  those  in 
squares,  Red’s  strategies.  Assuming  that  Blue  begins  the  series  of  plays  by 
selecting  ®,  the  successive  approximations  are  shown  by  Table  1.  The 
symbols  in  the  table  have  the  following  meaning:  N  is  the  number  of  the 
play;  i(N)  is  the  pure  strategy  chosen  by  Blue  for  the  Nth  play;  K-i(N)  is 
the  total  receipts  of  Blue  after  N  of  his  plays  if  Red  used  his  pure  strategy 
[1]  constantly,  and  similarly  K2{N )  and  /G(A’) ;  v(N)  is  the  least  that  Blue 
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Table  1.  Successive  Approximations 

N  i(N)  Ki(N)  K,(N)  K3(N )  v(N)  j(N)  Ih(N)  H,(N)  HS(N )  v(N)  v(N)  -  v(N) 


1 

2 

1 

0 

.000 

0 

0 

3 

-3 

3.000 

3.000 

2 

© 

4 

1 

3 

.500 

0 

1 

3 

0 

1.500 

1.000 

3 

© 

6 

1  h-*  ! 

6 

.333 

0 

2 

3 

3 

1.000 

.667 

4 

© 

8 

1 

9 

.250 

0 

3 

3 

6 

1.500 

1.250 

5 

© 

7 

4 

6 

.800 

0 

4 

3 

9 

1.800 

1.000 

6 

© 

6 

7 

3 

.500 

0 

4 

6 

6 

1.000 

.500 

7 

© 

8 

7 

6 

.857 

0 

4 

9 

3 

1.286 

.429 

8 

© 

10 

7 

9 

.875 

0 

5 

9 

6 

1.125 

.250 

9 

© 

12 

7 

12 

.778 

0 

6 

9 

9 

1.000 

.222 

10 

© 

14 

7 

15 

.700 

0 

7 

9 

12 

1.200 

.500 

11 

© 

13 

10 

12 

.909 

0 

8 

9 

15 

1.364 

.455 

12 

© 

12 

13 

9 

.750 

0 

8 

12 

12 

1.000 

.250 

can  expect  to  receive,  on  the  average,  after  N  of  his  plays;  j(N)  is  the  pure 
strategy  chosen  by  Red  for  his  Nth  play;  IIi(N)  is  the  total  receipts  of 
Blue  after  N  plays  of  Red  against  the  constant  strategy  ©  of  Blue,  and 
similarly  H2(N)  and  H3(N)]  v(N)  is  the  most  that  Blue  can  expect  to 
receive  on  the  average,  after  N  plays  of  Red. 

v(N)  —  j  min  Kj(N)  and  v(N)  =  j  max  //,(iV). 

Table  1  has  been  completed  as  follows:  For  the  first  play  of  the  game, 
assume  that  Blue  chooses  ®.  Then  Blue  will  receive  2,  1,  or  0  depending 
on  whether  Red  chooses  0,  0,  or  0.  Red  will  therefore  choose  0  for  his 
first  play,  since  that  minimizes  Blue’s  receipts;  and  Blue  will  thus  receive 
0,  3,  or  —3.  For  the  second  play,  Blue  will  choose  ©  since  that  maximizes 
his  receipts  against  Red’s  first  play.  Thus  after  two  plays,  Blue  has  received 
a  total  of  4,  1,  or  3  depending  on  whether  Red  chooses  0,  0,  or  0.  Red 
will  therefore  choose  0  since  that  minimizes  Blue’s  receipts  for  N  —  2, 
and  makes  Blue’s  receipts  total  1,  3,  or  0,  depending  on  whether  Blue 
chooses  ®,  ©,  or  ®.  We  obtain 

v(2)  =  |  =  0.50  and  v(2)  =  §  =  1.50. 

The  process  is  identical  for  all  successive  N. 

To  obtain  an  approximation  to  an  optimal  strategy,  we  determine  the 
relative  frequencies  of  each  of  the  pure  strategies  in  the  table.  Thus  at 
N  =  12,  we  have 

X  =  Cnj>  T2I  A);  X  —  -j^-,  xi). 

The  value  of  the  game  is  approximated  by  v{N)  and  v(N).  Thus  at  N  =  12, 
the  value  of  the  game  is  between  0.75  and  1.00. 
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To  solve  a  3  X  3  game,  we  plot  the  rows  and  columns  separately  on 
one  of  these  systems  of  coordinates  in  two  dimensions  and  consider  the 
two  triangles  formed  by  the  two  sets  of  three  points.  If  both  triangles 
contain  the  origin,  then: 

(a)  The  game  has  a  solution  in  which  all  strategies  are  active. 

(b)  The  solution  is  unique  and  the  game  is  completely  mixed  if  and 
only  if  the  origin  is  actually  interior  to  both  triangles. 

(c)  The  positions  of  the  origin  with  respect  to  triangular  coordinates 
will  determine  the  weights  which  each  player  should  attach  to  his 
strategies.  For  example,  in  Fig.  11,  plotted  in  triaxial  coordinates, 
strategy  ©  receives  a  weight  OBJ AOB. 


If  all  strategies  are  not  active,  further  information  may  be  obtained 
from  the  triangle  not  having  the  origin  in  its  interior  (possibly  both 
triangles) : 

(a)  Construct  any  line  through  the  origin  so  that  the  triangle  lies 
entirely  in  one  half-plane. 

(b)  The  vertices  of  the  triangle  correspond  to  the  strategies  of  one 
player.  Represent  those  of  the  other  player  by  any  set  of  three 
points  on  the  positive  axes  of  the  triaxial  system,  for  example 
the  points 

(2,  -1,  —  1),  (—1,2,  — 1),  (— 1,  -R2). 

In  triangular  coordinates,  use  the  three  vertices 

(1,  0,  0),  (0,  1,  0),  (0,  0,  1). 


METHODS  OF  SOLVING  GAMES  87 

(c)  Then  some  combination  of  the  strategies  which  lie  in  the  same 
half-plane  with  the  triangle  dominates  some  combination  of  those 
in  the  opposite  half-plane.  If  open  half-planes  can  be  used,  then 
the  dominance  is  strict. 

We  can  interpret  the  weak  dominance  to  mean  that  the  dominating 
row  (dominating  column)  must  appear  in  the  solution,  while  at  least  one 
of  the  other  two  may  be  excluded. 

Exam-pie.  Let  the  payoff  matrix  be  given  by 

m  a  a 

®r-i  3  -3" 

©  2  0  3. 

®  L  2  1  0_ 

The  triaxial  representation  of  the  strategies  are  as  follows: 

©  =  (-2,  10, -8),  0  =  (-6,3,3), 

©=(1, -5,4),  0  =  (5,  -4,-1), 

®=(3,0, -3),  0  =  (-9,9,0). 

Plotting  the  strategies  on  triaxial  coordinates,  we  obtain  the  two  triangles 
®  ©  ®  and  0  a  0  (Fig-  12).  The  triangular  positions  of  the  origin  are 


Figure  12 


(h  f,  0)  and  (l,  §,  }).  The  left-hand  figure  shows  that  0  dominates  a 
combination  of  0  and  0,  but  not  strictly.  Therefore  some  solution  ex¬ 
cludes  0  and  is  (|,  f,  0),  (0,  f,  g).  The  value  of  the  game  is  1, 
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If  the  matrix  is  n  X  3,  the  rows  may  be  represented  as  n  points  in  a 
triaxial  system.  The  resulting  configuration  will  be  useful  in  picking  out 
which  triples  of  columns  to  test  for  a  3  X  3  solution. 

6.  MAPPING  METHOD  FOR  SOLVING  GAMES  WITH  CONSTRAINTS 

In  solving  for  an  optimal  strategy  of  a  game  we  assume  that  each  pure 
strategy  may  be  played  with  any  probability  between  zero  and  one.  Aside 
from  the  influence  of  the  payoff  matrix,  we  have  no  preference  of  strategies 
or  any  desire  to  avoid  certain  strategies,  whether  pure  or  mixed.  Now  in 
some  situations  we  may  wish  to  assure  that  a  certain  pure  strategy  is  played 
at  least  a  certain  per  cent  of  the  time,  or  that  we  play  a  given  pure  strategy 
no  more  frequently  than  some  other  given  strategy.  Such  a  restriction 
might  arise  from  the  ease  or  difficulty  of  playing  certain  strategies  or  from 
the  dependence  of  strategies.  If  we  restrict  the  choice  of  strategies  in  any 
way,  we  refer  to  the  game  as  a  game  with  constraints. 

We  have  previously  shown  that  X*,  Y*  is  a  solution  of  the  game  defined 
by  the  matrix  A  —  (a,-,)  if  and  only  if 

max  X'A  Y*  =  minX*'AF  =  X*'AY*. 
x  y 

If  the  game  has  constraints  then  the  preceding  conditions  are  also  necessary 
and  sufficient  to  define  a  solution,  where  X  and  F  are  now  the  constrained 
sets  of  strategies. 

The  solution  X*,  F*  of  an  arbitrary  game  with  constraints  has  the 
property  that  it  simultaneously  maximizes  X' AY*  with  respect  to  X  and 
minimizes  X*rAY  with  respect  to  F.  If  we  consider  the  set  of  strategies  X 
and  the  set  of  strategies  F,  then  X*  is  some  point  of  X  such  that  the  maxi¬ 
mum  of  X'A  F*  is  assumed  at  X*,  and  F*  is  some  point  of  I  such  that 
the  minimum  of  A*,A  F  is  assumed  at  Y*.  It  is  this  property  that  enables 
us  to  find  graphically  the  solution  of  a  game  with  constraints. 

The  method  of  solution  consists  of  partitioning  the  sets  of  strategies  X 
and  F  by  means  of  the  payoff  function.  For  each  partition  of  X  we  shall 
associate  some  subset  Fi  of  F.  Similarly,  for  each  partition  of  F  we  shall 
associate  some  subset  Xi  of  X.  Xi,  Fi  will  be  a  solution  if  A'\  is  associated 
with  F i  and  F i  is  associated  with  Xi. 

The  method  can  best  be  described  by  means  of  an  example.  Let  the 
payoff  matrix  be 


m 

0 

0 

'  3 

39 

30" 

© 

33 

9 

0 

© 

28 

4 

25 
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We  wish  to  find  the  solutions  of  this  game  such  that  Blue  has  the 
following  constraints: 

tV  <  *1  <  i, 

To  fs  £2  < 

*3  >  1(1  -  V*0- 

This  implies  that  ©  must  be  used  at  least  ten  per  cent  of  the  time  but 
no  more  than  eighty  per  cent  of  the  time ;  ©  must  be  used  at  least  five  per 
cent  of  the  time  but  no  more  than  fifty  per  cent  of  the  time ;  the  frequency  of 
using  strategy  ®  depends  on  ©,  namely  x3  >  f  (1  —  ©Xi)  ■  Of  course, 
+  x2  +  x3  =  1,  always. 

Let  us  also  assume  that  Red  has  the  following  constraints: 

To  <  2/2  <  2 yh  y3  > 

The  constraints  of  the  first  player  imply  that  he  picks  only  those 
mixed  strategies  lying  within  or  on  the  pentagon  ABODE  in  Fig.  13.  The 


second  player  picks  only  those  mixed  strategies  lying  within  or  on  the 
triangle  XYZ.  The  boundaries  of  the  pentagon  and  triangle  are  determined 
by  the  constraints.  Now  xh  x2,  1  —  xx  —  x2  and  yi,  y2,  1  —  yx  —  y2  are  the 
probabilities  of  playing  strategies  ®,  ©,  ©  and  [j],  ©,  ©,  respectively. 
The  expected  payoff  is 

<t>  =  (10x2  -  10xi  +  l)yi  +  (10x2  +  10xi  -  7)y2  +  i(^  -  25x2  +  25) 

=  (-IO2/1  +  IO2/2  +  |)xi  +  (IO2/1  +  102/2  -  ¥)*2  +  (2/i  -  7y2  +  $?). 
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Let  us  rewrite  the  expectation  $  as  follows: 

<t>  —  piyi  +  P2Z/2  +  Pi 

—  qiXi  +  qix  2  +  q%, 

where  pi  =  10x2  —  10xi  +  1,  51  =  — 10?/ 1  +  10?/ 2  +  -f,  etc. 

Now  pi  =  0,  pi  =  0  divide  the  pentagon  ABODE  into  four  regions: 
OLABM,  wherein  pi  >  0,  p2  >  0;  OMCDN,  wherein  pi  <  0,  p2  >  0;  OEN, 
wherein  pi  <  0,  p2  <  0;  OEL,  wherein  pi  >  0,  p2  <  0.  Similarly,  gi  =  0, 
q-i  —  0,  divides  the  triangle  XYZ  into  two  regions:  WZXV,  wherein 
qi  >  0,  qz  <  0;  VWY,  wherein  q%  <  0,  g2  <  0. 

For  each  point  in  the  region  OLABM  the  expectation  4>,  as  a  function 
of  Red’s  strategies,  assumes  the  minimum  at  point  Z  in  the  triangle  XYZ. 
However,  for  the  point  Z,  the  expectation  <t>,  as  a  function  of  Blue’s  strate¬ 
gies,  assumes  its  maximum  at  point  D  on  the  pentagon  ABODE,  and 
outside  the  region  OLABM.  Therefore,  no  point  inside  region  OLABM 
can  provide  a  solution  of  the  game. 

Consider  the  point  O,  defined  by  pi  =  0  and  p2  =  0,  in  the  pentagon. 
For  this  point  <j>  assumes  its  minimum  at  every  point  in  the  triangle  XYZ. 
In  particular,  the  minimum  is  assumed  at  V.  The  point  V  is  characterized 
by  qi  =  qi  =  0.  For  the  point  V,  the  maximum  value  of  4>  is  assumed  at 
every  point  of  the  pentagon  and  hence  at  O.  Therefore  the  points  O  and  V 
are  solutions  of  the  game  with  constraints.  These  points  are  the  intersection 
of  pi  =  0  with  p2  =  0  and  the  intersection  of  gr  =  0  with  g2  =  0.  O  is 
given  by 

*1  =  rtr>  xi  ts>  x?‘  ~  To- 

V  is  given  by 

2/i  =  f ,  2/2  =  f ,  2/s  =  I- 

There  are  additional  solutions  and  they  can  also  be  found  by  the  same 
procedure.  Consider  the  points  in  region  OEN  having  the  property  that 
Pi  =  P2  <  0.  This  determines  the  line  Xi  =  f,  x2  <  /o-  For  each  point 
on  this  line  the  minimum  value  of  <j>  is  assumed  at  each  point  of  XY.  In 
particular,  the  minimum  is  assumed  at  V.  But  if  Red  uses  strategy  V, 
then  <t>  will  assume  its  maximum  at  every  point  in  the  pentagon,  and  in 
particular  on  the  line  Xi  =  f,  x2  <  Therefore  this  line  is  a  solution  of 
the  game.  It  can  be  verified,  by  this  procedure,  that  there  are  no  other 
solutions. 

We  have  found  that  the  solution  of  this  game  is: 

Blue :  X\  —  ■§■,  —  x%  —  tsj  xz  =  1  X\  x2. 

Red:  pi  =  h  p2  =  i  2/3  =  I- 

The  value  of  the  game  is  6.5. 


METHODS  OF  SOLVING  GAMES  91 

7.  MAPPING  METHOD  FOR  SOLVING  GAMES 

Of  course  this  mapping  or  fixed-point  method  can  be  used  to  solve 
games  without  constraints.  For  a  3  X  3  game  the  strategy  space  is  a  right 
triangle  for  each  player.  For  example,  let  the  game  be  defined  by  the 
following  payoff  matrix: 

"-3  2  0" 

A  -  0  12. 

1  2  1_ 

The  expectation,  <£,  as  a  function  of  a  mixed  strategy  of  each  player  is 

$  ~  (  —  3a?i  2x^)y\  +  ( Xi  —  2x2  d-  1)2/2  d~  (  —  Xi  x%  1) 

=  (  32/i  +  y2  —  l)xi  +  ( —  2t/i  —  22/2  +  1)2:2  ~b  (2/2  +  1) 

=  Pm  +  Pm  +  Pa 

—  Q1X1  +  q2X2  +  §3, 

where 

Pi  =  —  3xi  —  2^2,  <JT  =  —Syi  -J-  2/2  —  1, 

P2  =  xi  2x2  +  1,  52  =  — 2yi  —  2y2  -f-  1, 

Pa  =  —xi  +  x2  +  1,  ?3  =  2/2  +  1. 

In  the  Fig.  14,  the  lines  pi  —  0  and  p2  =  0  divide  the  strategy  space 
XWZ  of  Blue  into  two  regions  XYT  and  YTWZ.  Similarly,  the  lines 


o 


Figure  14 


92  METHODS  OF  SOLVING  GAMES 

<?i  =  0,  g2  =  0  divide  Red’s  strategy  space  ABD  into  two  regions  ABCE 
and  CDE.  The  regions  are  characterized  as  follows: 

XYT:  pi  <  0,  p?  <  0 
YTWZ:  pi  <  0,  p^  >  0 
ABCE:  qi  <  0,  q2  <  0 
CDE:  qi  <  0 ,  q2  >  0. 

Consider  the  point  Z  for  which  pi  =  0  and  p2  =  1.  For  this  point  the 
minimum  of  <t>  is  assumed  at  i/2  =  0  or  the  line  AD.  In  particular  the  mini¬ 
mum  is  assumed  at  every  point  of  the  line  EA.  Now  each  point  on  line 
EA  has  <]i  <  0,  q2  <  0,  for  which  the  maximum  value  of  4>  is  assumed  at 
Xi  =  0,  x2  —  0,  or  at  the  point  Z.  Therefore  the  point  Z  is  a  solution  for 
Blue  and  the  line  EA  is  a  solution  for  Red. 

In  terms  of  the  strategies,  the  solutions  are: 

for  Blue :  X\  =  0,  x2  =  0,  x3  =  1 

for  Red:  i  <  yi  <  1,  2/2  =  0,  y3  =  1  -  yi  -  y%. 

The  value  of  the  game  is  1. 

8.  SOLUTION  OF  RECONNAISSANCE  GAME  BY  MAPPING  METHOD 

In  section  15  of  Chapter  3  we  showed  how  the  problem  of  reconnais¬ 
sance  can  be  considered  as  a  game  of  strategy.  We  considered  a  particular 
reconnaissance  game  which  yielded  the  payoff  matrix 

"46  22  39  27" 

10  34  39  27 
55  31  48  24 
_19  43  12  36_ 

We  listed  the  optimal  strategies  without  showing  how  they  were  computed. 
Using  the  mapping  or  fixed-point  method,  we  shall  now  compute  the 
optimal  strategies. 

If  Blue  uses  a  mixed  strategy 

X'  =  (xi,  x2,  xh  1  —  Xi  -  x2  —  x3) 
and  Red  uses  a  mixed  strategy 

Y'  =  (2/1, 2/2,  Vh  1  -  2/i  -  2/2  -  2/s), 

then  the  expectation  of  Blue  for  the  above  matrix  may  be  written  as 
<t>  =  ViVi  +  P22/2  +  ViVz  +  Pi 
—  QiXi  +  q2Xi  +  q3x3  +  qi, 
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where 

Pi  =  36ti  +  48x3  —  17  <h  =  367/!  —  12 t/2  +  36t/3  —  9 

Pi  —  —  12zi  +  7  52  —  36^/3  —  9 

p3  =  36a'!  +  36x2  +  48x3  -  24  g3  =  48 yx  +  48t/3  -  12 

Pa  —  9xi  9x2  12x3  — j—  36  <p  =  — 17?/i  -f-  7 2/2  —  242/3  -t~  36. 

Introduce  the  following  notation : 
if  pi  >  0,  then  write 

Pi  —  +> 

if  qi  <  0,  then  write 

Si  = 

if  p3  <  p2  <  pi  <  0,  then  write 

Pi  =  -,  P2  = - ,  p3  = - , 

if  pa  >  pi  >  pi  >  0,  then  write 

Pi  —  +>  P2  =  +  +,  Ps  =  +  +  +• 

For  any  point  in  the  strategy  space  of  Blue  for  which  px  =  +  ,  p2  =  +, 
Pa  —  +,  the  minimum  value  of  </>  is  assumed  at  the  point  2/1  =  y<i  =  ya  =  0. 
For  the  latter  point  in  Red’s  strategy  space,  we  have 

3i  =  -9,  q2  =  -9,  qa  =  -12, 
or 

Qi  —  qa——,  qa  —  — . 

For  these  values  of  qi,  the  maximum  value  of  4>  is  assumed  at  xi  =  x2  — 
x3  =  0  which  yields 

Pi  =  -,  Pa  =  +,  Pa  = 

Therefore  no  point  in  Blue’s  strategy  space  for  which  pi  >0,  p2  >  0, 
Pa  >  0  can  provide  a  fixed  point  or  a  solution  of  the  game.  In  a  similar 
way  we  map  each  of  the  possible  regions  in  the  space  of  strategies  of  Blue. 
Table  2  summarizes  the  mapping  of  the  entire  space  of  Blue’s  strategies. 

Using  the  table,  we  note  that  the  region  defined  by  pi  >  0,  p2  >  0, 
p3  =  0  maps  back  into  itself.  It  follows  that  the  optimal  strategies  for 
Blue  are  given  by  the  convex  set  X(A)  defined  by 

36xi  +  48x3  -  17  >  0, 

-12xx  +  7  >  0, 

36xi  +  36x2  +  48x3  —  24  =  0, 
xi  >0,  x2  >  0,  x3  >  0,  X4  =  1  —  Xi  —  x2  —  x3. 

From  the  table,  it  is  seen  that  Red  has  a  unique  optimal  strategy  given  by 

2/i  =  2/2  =  0,  2/3  =  2/4  =  f. 


Table  2.  Solution  of  Reconnaisance  Games  by  Mapping  Method 


(1) 

If  Blue’s  strategy 
is  such  that 

Pi  Pi  Vz 

(2) 

Using  (1),  expectation, 

<j>,  assumes  its 
minimum  at 

Vi  Vi  Vz 

(3) 

Using  (2), 

value  of  coefficients  qi  at  minimum 

9i  qt  ?s 

(4) 

Using  (3),  expecta¬ 
tion,  <t>,  assumes  its 
maximum  at 

Xt  Xi  x 3 

(5) 

Using  (4),  value  of 
coefficients  pi  at 
maximum 

Pi  Pi  Pz 

+ 

+ 

+ 

0 

0 

0 

-9 

-9 

-12 

0 

0 

0 

— 

+ 

- 

+ 

+ 

- 

0 

0 

1 

27 

27 

36 

0 

0 

1 

+ 

+ 

+ 

— 

— 

0 

0 

1 

27 

27 

36 

0 

0 

1 

+ 

+ 

+ 

0 

0 

- 

0 

0 

1 

27 

27 

36 

0 

0 

1 

+ 

+ 

+ 

+ 

- 

+ 

0 

1 

0 

-21 

-9 

-12 

0 

0 

0 

- 

+ 

- 

— 

- 

0 

1 

0 

-21 

-9 

-12 

0 

0 

0 

- 

+ 

- 

0 

— 

0 

0 

1 

0 

-21 

-9 

-12 

0 

0 

0 

- 

+ 

- 

— 

+ 

+ 

1 

0 

0 

27 

-9 

36 

0 

0 

1 

+ 

+ 

+ 

- 

- 

1 

0 

0 

27 

-9 

36 

0 

0 

1 

+ 

+ 

+ 

— 

0 

0 

1 

0 

0 

27 

-9 

36 

0 

0 

1 

+ 

+ 

+ 

+ 

0 

a 

1  -  a 

27  -  48a 

27  - 

36a 

36  - 

4Sa 

0 

+ 

- 

0 

a 

1  -  a 

27  -  48a 

27  - 

36a 

36  - 

48a 

0 

+ 

0 

0 

a 

1  -  a 

27  -  48a 

27  - 

36a 

36  - 

48a 

0 

+ 

+ 

a 

0 

1  —  a 

27 

27  - 

36a 

36 

0 

0 

1 

+ 

+ 

+ 

- 

a 

0 

1  -  a 

27 

27  - 

36a 

36 

0 

0 

1 

+ 

+ 

+ 

0 

a 

0 

1  —  a 

27 

27  - 

36a 

36 

0 

0 

1 

+ 

+ 

+ 

+ 

a 

1  -  a 

0 

48a: 

-  21 

-9 

48a 

-  12 

0 

+ 

- 

a 

1  -  a 

0 

48a 

-  21 

-9 

;  48a 

-  12 

0 

+ 

0 

a 

1  -  a 

0 

48a 

-  21 

-9 

48a 

-  12 

0 

+ 

- 

- 

- 

a 

0 

1  —  a  —  0 

27 

-  48/3 

27  -  36a 

-  36/3  36  - 

48/3 

0 

+ 

0 

+ 

+ 

a 

0 

0 

36a 

-  9 

-9 

48a 

-  12 

0 

Pi 

> 

P3 

+ 

0 

+ 

0 

a 

0 

-12a 

-  9 

-9 

-12 

0 

0 

0 

— 

+ 

— 

+ 

+ 

0 

0 

0 

a 

36a 

-  9 

36a  -  9 

48a 

-  12 

- 

- 

-  if 

a  <  i 

0 

0 

0 

- 

+ 

- 

+ 

+ 

+  if 

a>  i 

0 

0 

1 

+ 

+ 

+ 

0 

0 

0  if  a  =  i 

a 

b 

c 

+ 

+ 

0 
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Blue’s  basic  solutions  are  the  vertices  of  the  convex  set  X(A).  To 
determine  the  vertices  we  h^ve 

2  3ri  3z2  .  r.  i  >  2 

x  _  — - - - >0  or  *i  4-  Xi  <  f. 

4 

From  the  inequalities,  we  obtain 

*2  ^  nV)  —  T£- 

These  three  inequalities  and  the  equality,  together  with  Xi  >  0,  x2  >  0, 
determine  a  convex  set  having  five  vertices  whose  coordinates  are 

(0)  A)  ii i  Trr)>  inri  0)  "I)j  (tsi  A>  ^)» 

(rs>  0,  xiff,  xl),  (0)  0,  i,  i)- 
These  five  vertices  are  the  basic  solutions  for  Blue. 


6  GAMES  WITH  INFINITE 
NUMBER  OF  STRATEGIES 


1.  INTRODUCTION 

In  a  finite  game,  each  player  selects  a  strategy  from  a  finite  or  discrete 
set  of  strategies.  The  number  of  such  strategies  may  be  large,  as  in  chess, 
but  finite.  A  natural  generalization  is  to  consider  games  in  which  a  player 
chooses  a  strategy  from  an  infinite  set  of  strategies.  Such  a  game  is  called 
a  continuous  or  infinite  game.  The  name  derives  from  a  closed  interval’s 
being  called  a  continuum.  There  is  no  loss  of  generality  if  we  assume  that 
the  strategies  are  represented  by  points  on  the  closed  interval  [0,  1].  For, 
if  S  is  the  set  of  strategies,  then  by  relabeling  the  elements  of  S,  we  can  get 
a  game  in  which  the  selection  of  a  strategy  is  made  from  the  closed  interval 
[°,  i]. 

There  are  several  reasons  for  developing  a  theory  of  continuous  games. 
Many  military  and  economic  problems,  when  viewed  as  games,  involve  an 
infinite  number  of  strategies.  For  example,  a  military  budget  can  be 
thought  of  as  being  divisible  in  an  infinite  number  of  ways  between  of¬ 
fense  and  defense.  In  economics  a  commodity  may  have  an  infinite  number 
of  price  possibilities.  Further,  computations  using  a  continuous  variable 
are  sometimes  easier  than  those  using  a  variable  which  takes  on  a  finite, 
but  large,  number  of  values. 

2.  DESCRIPTION  OF  CONTINUOUS  GAMES 

In  its  simplest  form,  a  continuous  game  may  be  described  as  follows: 
Blue  chooses  a  strategy  x,  where  0  <  x  <  1,  and  simultaneously  Red 
chooses  a  strategy  y,  where  0  <  y  <  1.  The  choices,  x  and  y,  determine 
a  play  of  the  game,  whose  outcome  is  measured  by  a  payoff  M(x,  y)  to 
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Blue.  Since  the  game  is  assumed  to  be  zero-sum,  the  payoff  to  Red  is 
—M(x,  y). 

Let  us  assume  that  the  payoff  function  Mix,  y)  has  a  minimum  with 
respect  to  y  for  each  x  and  a  maximum  with  respect  to  x  for  each  y.  Sup¬ 
pose  Blue  chooses  some  strategy  x;  then,  depending  on  Red’s  choice  of 
his  strategy,  Blue  will  receive  a  payoff  of  at  least 

(6.1)  min  M{x,  y)  =  f(x). 

V 

Now  Blue  can  choose  x  to  make  f(x)  as  large  as  possible.  Hence  there  is 
a  way  for  Blue  to  play  or  there  exists  a  strategy  for  Blue,  so  that  he 
receives  at  least 

max  min  M(x,  y). 

x  y 

Similarly,  there  exists  a  strategy  for  Red  such  that  Blue  receives  at  most 

min  max  M(x,  y). 

y  x 

It  is  easily  shown  that  these  two  quantities  satisfy  the  inequality 

(6.2)  max  min  M(x,  y)  <  min  max  M(x,  y). 

x  y  y  x 

If  in  (6.2)  the  equality  holds,  or  that 

(6.3)  max  min  M (x,  y)  —  min  max  M(x,  y)  =  M (x0,  y0), 

x  y  y  x 

then  M  ( x ,  y)  has  a  saddle-point  at  xa,  yo-  Since  the  strategies  x0  and  y0 
satisfy  the  conditions 

(6.4)  M(x o,  y)  >  M(x 0,  y0)  for  all  y, 

M(x,  y0)  <  M(x o,  yo)  for  all  x, 

we  call  x0  an  optimal  strategy  for  Blue  and  y0  an  optimal  strategy  for  Red. 
Thus,  if  a  continuous  game  has  a  saddle-point,  then  the  saddle-point  yields 
a  pair  of  optimal  strategies,  one  for  each  player. 

3.  MIXED  STRATEGY— DISTRIBUTION  FUNCTION 

Suppose  the  game  is  such  that 

(6.5)  max  min  M(x,  y)  <  min  max  M(x,  y) ; 

xy  y  x 

then  the  game  does  not  have  a  saddle-point.  It  will  be  recalled  that  if  a 
finite  game  does  not  have  a  saddle-point  it  is  necessary  to  introduce  mixed 
strategies,  which  are  probability  distribution  functions  over  the  finite  set 
of  strategies.  In  order  to  solve  infinite  games  without  saddle-points  it  will 
be  necessary  to  introduce  probability  distribution  functions  over  the  in¬ 
finite  set  of  strategies.  However,  it  is  no  longer  possible  to  think  of  a 
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mixed  strategy  as  a  rule  which  ascribes  a  probability  to  each  number  in 
the  closed  interval  [0,  1].  We  need  a  different  definition  of  probability 
for  infinite  number  of  strategies.  Of  course,  any  definition  we  formulate  for 
the  infinite  game  must  also  apply  to  the  finite  game. 

A  mixed  strategy  in  an  infinite  game  is  a  random  process  for  choosing  a 
number  x  from  the  interval  [0,  1].  We  can  also  think  of  a  mixed  strategy 
as  a  random  process  for  choosing  a  number  no  larger  than  x  from  the  in¬ 
terval  [0,  1].  If  £  is  a  random  variable  such  that  0  <  £  <  1,  then  a  mixed 
strategy  is  a  function  F  such  that,  for  all  x  in  [0,  1], 

(6.6)  F{x)  =  pr  {£  <  x}. 

That  is,  F(x)  is  the  probability  that  the  number  chosen  by  the  random 
process  F  is  at  most  x.  For  mathematical  convenience,  we  shall  modify 
the  definition  slightly  for  the  case  where  x  =  0.  We  shall  define  F( 0)  =  0. 
Thus  F(0)  is  the  probability  that  the  number  chosen  will  be  actually  less 
than  zero,  not  at  most  zero.  Suppose  0  <  x\  <  x2  <  1;  then  we  see  that 

F{x 2)  —  F{x j)  =  pr  (zi  <  £  <  x2) 
and  F(x2)  —  F( 0)  =  pr  (0  <  £  <  x2}. 

The  function  F  is  called  a  cumulative  distribution  function,  or  simply  a 
distribution  function. 

From  our  definition  of  F  in  (6.6)  as  a  probability  function  it  follows 
that  F  has  the  following  properties: 

(i)  F(x)  >  0  for  all  0  <  x  <  1. 

(ii)  F(0)  =  0,  F(l)  =  1. 

(iii)  F  is  a  nondecreasing  function  of  x. 

F(x i)  <  F(xf)  whenever  X\  <  x2. 

(iv)  F  is  right-hand  continuous  in  the  open  interval  (0,  1),  or 

lim  F(x  +  e)  =  F(x). 

e — >0 

The  last  result  follows  from  the  fact  that 

F(x  +  e)  —  F{x)  —  pr  {x  <  £  <  x  +  «}. 

Now  for  any  given  £  it  is  possible  to  find  an  e  >  0  which  is  sufficiently 
small  so  that  the  above  inequality  cannot  hold.  Hence  the  probability 
that  £  satisfies  this  inequality  approaches  zero  as  e  approaches  zero. 

Any  function  F  which  satisfies  the  above  four  conditions  is  a  distribu¬ 
tion  function.  It  is  obvious  that  for  a  finite  set  of  strategies  we  can  get  a 
mixed  strategy  from  the  distribution  function  over  the  finite  set  of 
strategies. 

We  can  construct  new  distribution  functions  from  given  ones  as  follows: 
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Let  Fi,  Fit .  .  .  ,  Fn  be  n  distribution  functions;  then  by  forming  any  convex 
linear  combination  of  these  functions, 

F(x )  —  aiFi(x)  +  .  .  .  +  anFn{x), 

where  ca  >  0  and  2  a,  =  1,  we  get  another  distribution  function. 

Although  a  distribution  function  must  be  right-hand  continuous,  it 
may  be  discontinuous.  A  common  class  of  distribution  functions  which 
are  discontinuous  are  the  step-functions.  Define  the  following  function : 

Ia{x )  =0  if  x  <  a, 

=  1  if  x  >  a. 

It  is  obvious  that  if  0  <  a  <  1,  then  Ia(x)  is  a  distribution  function.  It 
has  a  discontinuity  at  x  —  a,  with  a  jump  of  1  at  a.  We  call  Ia{x)  a  step- 
function  with  one  step  at  a. 

The  distribution  function 

F(x )  —  ajxi(x)  +  ctzlxfx)  +  .  .  .  +  anIx„(x), 

where  a{  >  0,  2  a,  =  1,  0  <  x,  <  xi+1  <  1,  is  a  step-function  with  n  steps. 

To  assist  us  in  computing  a  distribution  function  F(x),  we  make  use  of 
the  following  definition  of  choosing  a  point  at  random  on  a  line :  If  a  point 
is  chosen  at  random  on  a  line  interval  of  length  l,  the  probability  that  it  is 
in  a  given  subinterval  of  length  X  is  \/l. 


4.  EXPECTATION— STIELTJES  INTEGRAL 

Having  defined  a  mixed  strategy  as  a  probability  distribution  function, 
we  now  need  to  define  expectation.  Suppose  Blue  receives  a  payoff  P(x) 
if  he  chooses  strategy  x.  Suppose  further  that  Blue  chooses  his  strategy  x 
by  means  of  a  random  device  which  picks  x  according  to  the  probability 
distribution  function  F(x).  We  wish  to  determine  the  expectation  of  P(x) 
with  respect  to  the  probability  distribution  function  F(x). 

It  is  clear  that  we  cannot  say  this  expectation  is  simply  the  sum  of  all 
products  P{x)F{x).  On  the  other  hand  we  may  approximate  the  expecta¬ 
tion  by  dividing  the  interval  [0,  1]  into  a  large  number  of  subintervals, 
computing  the  expectation  over  each  subinterval,  and  adding  these  expec¬ 
tations  for  all  the  subintervals.  Suppose  we  divide  the  interval  [0,  1]  by 
the  points 

0  =  x0  <  Xi  <  Xz  <  .  .  .  <  xn  =  1. 

Let  A  =  max  (xt  —  x,-_i) 

l<i<n 

and  let  X;-i  <  Z{  <  for  1  <  i  <  n. 


GAMES  WITH  INFINITE  NUMBER  OF  STRATEGIES  101 


Then  we  can  approximate  the  expectation  of  P(x)  by 
2  P(Zi)[F(xi)  -  F(x «)]. 

»  =  i 

lim  2  PiZiWixi)  -  F(*w)] 

n—>* o  i=l 
A— >0 

exists  and  is  independent  of  the  choice  of  Zi,  then  this  limit  is  called  the 
Stieltjes  integral  of  P  with  respect  to  F  from  0  to  1  and  is  denoted  by 

(6.8)  //  P{x)  dF{x). 

We  say  that  if  P(x)  is  Blue’s  payoff  corresponding  to  a  strategy  choice  x, 
then  if  he  chooses  x  according  to  the  distribution  function  F,  his  expectation 
will  be 

E(P(x))  =  £  P(x)  dF(x). 

The  Stieltjes  integral  does  not  always  exist.  In  particular,  if  P  and  F 
have  a  common  point  of  discontinuity,  then  the  integral  does  not  exist. 
For  example,  let 

P{x )  =  F(x)  =  0  for  0  <  x  <  J, 

P(x)  =  F(x)  =  1  for  §  <  x  <  1. 

Here  exactly  one  of  the  differences  F(xi)  —  F(xi- 1)  is  different  from  0,  call 
it  F(xk)  —  F(x k-i),  which  is  equal  to  1.  Then  P(Zk)  =  0  or  1  depending 
on  the  choice  Zk.  Thus  the  limit  (6.7)  is  not  independent  of  the  choice  of 
the  Zi  s  and  hence  (6.8)  does  not  exist. 

5.  STIELTJES  INTEGRAL  FOR  CONTINUOUS  FUNCTION 

We  shall  now  show  that  if  P  is  continuous  in  the  interval  [0,  1],  then 

/„  P(P)  dF(x) 

exists. 

Suppose  the  interval  [0,  1]  is  divided  into  n  intervals  by  the  points 
0  =  x0  <  X\  <  Xi  <  .  .  .  <  xn  =  1.  Let  the  interval  \_Xk~i,  x^\  be  denoted 
by  H.  Define  the  following  quantities: 

Mh  =  max  P(x), 

H 

mk  =  min  P{x), 

H 

Sn  =  S  Mi[F(xi)  -  F( *w)], 

»=i 

sn  =  2  m,[F(x,)  —  F(xi_i)]. 


If 

(6.7) 
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It  is  evident  that 

sn  <  Sn  for  all  n. 

The  sequence  {S„}  decreases  with  increasing  n.  For,  let  the  interval 
[0,  1]  be  divided  into  n  parts.  To  obtain  a  division  of  the  interval  [0,  1] 


into  7i+l  parts  let  us  introduce  one  more  division  point,  x,  say  it  falls 
between  xt- and  Xi_i,  i.e.,  Xj_i  <  x  <  x,-. 

Define 

"Mi  =  max  P(x)  <  Mi, 

X  <X  <Xi 

Mi  =  max  P(x)  <  M 

Xi- 1  <x  <x 

Then 

Sn+i  =  S  Mi[F(xd  —  F(x<_ 1)], 

i  =  1 

-Sn  -  s„+1  =  Mi[F{xi)  -  F(xi-i)]  -  Mi[F(x)  -  F(x^)] 

-  miFfa)  -  F(x)] 

>  Mi[F(xi)  -  F(Xi _i)  -  F(x)  +  F(xi-i)  -  F(Xi)  +  F(x)] 

=  Mi  -  0  =  0. 

Thus 

Sn  ^  Sn+l- 

The  sequence  {s„}  increases  with  increasing  n.  For,  let  us  make  a 
similar  division  of  the  interval  [0,  1]  as  in  the  previous  case  and  define 

Wi  =  min  P(x)  >  mi, 

X<X<Xi 

Mi  =  min  P(x)  >  m,. 

Xt-i  <x  <T 

Then 

n  -f*  1 

Sn+l  =  2  77l,[P(Xi)  —  P(Xi_l)], 

i  =  1 

Sn+i  —  s„  =  nii[F(xi )  -  P(x)]  +  Mi[F(x )  -  P(x<_ i)] 

-  77li[P(Xi)  —  P(Xi_j)] 

>  7n,[P(xi)  —  F(x)  +  F(x)  —  F(xi-i)  —  F(xt)  +  F(xi_i)]  =  0. 

Therefore 

Sn+1  P  8n* 
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Combining  the  inequalities  on  sn  and  Sn  we  have 
Sn  ^  ^  Sn+1  ^ 

or  sn  <  Si  for  all  n. 

Therefore  the  increasing  sequence  {s»}  is  bounded.  Hence  it  has  a  limit. 
Let  this  limit  be  defined  by 

s  -  lim  s„ 

n— 

A— >0 

where  A  is  the  length  of  the  largest  of  the  n  intervals. 

We  also  have 

S ji -j-j  ^  "P  Sn  ^  S\. 

The  decreasing  sequence  {<S„}  has  a  greatest  lower  bound 

S  =  lim  Sn. 

n-~*& 

A— >0 

Since  P  is  continuous  over  the  closed  interval  [0,  1],  it  is  uniformly 
continuous.  Hence  corresponding  to  any  positive  number  t,  there  exists  a 
positive  number  5  such  that  if  \zi  —  zt\  <  S  then 

|P(zi)  -  P(z2)|  <  «■ 

This  means  that  if  A  <  5  then  |Af*  —  mk \  <  e  for  k  =  1,2, ...  ,n.  Thus 
0  <  Sn  —  S„  =  2  (Mi  —  ml)\F(Xi)  —  F(xi-i)] 

i=i 

<  *  2  [F(*<)  -  F(iw)]  =  t. 

t=i 

We  have  then 

0  <  S„  —  s„  =  (Sn  —  S)  +  (S  —  s)  +  (s  —  sn)  <  e. 

Each  of  the  three  terms  in  the  parentheses  is  >  0.  Since  e  can  be  made 
arbitrarily  small,  it  follows  that  each  of  the  three  terms  must  approach  0 
in  the  limit,  and 

S  =  s. 

Now  for  any  choice  of  the  z.’s  we  have 

Sn  <  2  P(Zi)[F(x,)  -  F(x*_0]  <  Sn. 

Since 

lim  $„  =  lim  Sn 

n — >  co  n~~* 

it  follows  that 

lim  2  P(zi)[F(xd  -  FM]  =  S 

n— >» 

A— *0 

exists  and  is  independent  of  the  choice  of  the  z/s. 
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6.  STIELTJES  INTEGRAL  AND  RIEMANN  INTEGRAL 

In  some  cases  a  Stieltjes  integral  may  be  evaluated  like  the  ordinary, 
or  Riemann,  integral.  In  particular,  if 

P(x)  dF(x) 

exists  and  if  F  has  a  derivative  F'  at  every  point  in  [0,  1]  then  we  have 
£  P(x)  dF{x)  =  JT1  P(x)F'{x)  dx. 

Hence,  in  this  case,  we  can  evaluate  the  Stieltjes  integral,  if  it  exists,  by 
evaluating  a  well-known  Riemann  integral,  if  it  exists. 

Let  the  interval  [0,  1]  be  divided  into  n  parts  by  the  points 
0  =  Xo  <  Xi  <  x2  <  .  .  .  <  xn  =*1.  Let  A  be  the  length  of  the  largest  in¬ 
terval.  Since  F  has  a  derivative  everywhere,  there  exist  yi  where  £i_i  < 
Vi  <  Xi  (i  =  1,  2, .  .  . ,  n)  such  that 

(6.9)  F(xt)  -  F(Xi_i)  =  F'(yi)(x i  -  Im). 

Now  if  the  Riemann  integral  P(x)F'(x)  dx  exists  it  is  defined  by 

(6.10)  lim  S  P{yt)F’(yt){xi  —  Xi~ i)  =  f1  P(x)F'(x)  dx 

n— i=l 

A— >0 

independent  of  the  choice  of  the  y/s.  Substituting  (6.9)  into  (6.10)  we  get 

(6.11)  lim  S  P(yi)[F(Xi)  -  FM]  =  P  P(x)F'(x)  dx 

n— i  =  l 

A— >0 

where  the  left  side  of  (6.11)  is  independent  of  the  choice  of  the  y’s.  But 
the  left  side  of  (6.11)  is  the  definition  of  the  Stieltjes  integral.  Hence 
we  have 

£  P(x)  dF(x)  =  JT1  P(x)F’{x)  dx. 

7.  STIELTJES  INTEGRAL  WITH  RESPECT  TO  A  STEP-FUNCTION 

If  F(x )  is  a  step-function,  or 

F(x )  =  Ia(x), 

and  if  P(x)  is  continuous  at  a,  then 

/0'  P{X)  dla(x )  =  P(a). 
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The  result  above  follows  from  the  continuity  of  P(x)  at  x  —  a.  Let 

the  points  of  division  of  [0,  1]  be  0  =  x0  <  xi  <  x%  <  ...  <  xn  =  1.  Let 

Xj-i  <  a  <  Xj.  Then 

F{x,)  -  F(xj _0  =  1  for  i  =  j, 

F(xt)  —  F(Xi- 1)  =  0  for  i  j. 

Therefore 

2  P(*i)[F( xt)  -  F(xi-i)]  =  P{z,) 

i  =  l 

where  <  zj  <  Xj.  But  from  the  continuity  of  P{z)  at  z  =  a,  we  have 
that 

lim  P{zi)  =  P(a). 

n— 

A->0 

Therefore 

(6  12)  1™  <ti  ~  F(x^i)]  =  P(a) 

A->0 

independent  of  the  choice  of  the  z.’s.  The  left  side  of  (6.12)  is  the  definition 
of  a  Stieltjes  integral,  hence 

/QX  P(X)  dljx)  =  P(a). 

Using  a  similar  argument  we  can  show  that  if 

F(x)  =  ajai(x)  +  a2Im(x )  +  .  .  .  +  anIa.(F) 

n 

where  a,-  >0,  2  a<  =  1,  0  <  <  1,  and  if  P(x)  is  continuous  at  the 

i=i 

n  values,  ai,  a2,  .  .  .  ,  <*„,  then 

P(x)  dF(x)  -  fliP(ai)  -f  a2P{a2)  +  .  .  .  +  anP(an). 

8.  SOME  PROPERTIES  OF  STIELTJES  INTEGRAL 

If  the  integrals  involved  exist,  it  can  be  readily  shown  that  they  have 
the  following  properties: 

(i)  £  P(x)  dF(x)  =  f°  P{x)  dF{ x)  +  \l  P(x)  dF{x). 

(ii)  /oX  [P(x)  +  Q(x)]  dF(x)  =  fol  P{x)  dF(x)  +  /oX  Q(x)  dF{x). 

(hi)  ^  P(z)  d[fcF(x)]  =  k  £  P(x)  dF(x). 

(iv)  |ol  P(z)  d[P(z)  +  G(x)]  -  l'  P(x)  dF(x )  +  fQ'  P(x)  dG(x). 
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(v)  JT1  dF(x)  =  F(  1)  -  F( 0)  =  1. 

(vi)  /q:  P(x)  dF(x )  =  P(1)F(1)  -  P(0)F(0)  -  £  F(x)  dP(x). 

(vii)  I  £  P(*)  dP(*)|  <  JT1  |P(*)|  rfP(rc). 

(viii)  If  P(a;)  <  Q(x)  in  [0,  1]  then 

£  P(x)  dF(x)  <  JT1  Q(*)  <iP(x). 
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1.  OPTIMAL  MIXED  STRATEGY 

Suppose  that  the  payoff  function  is  M(x,  y ),  and  suppose  Blue  chooses 
his  strategy  x  from  [0,  1]  using  the  distribution  function  F(x).  Then  for 
any  strategy  y  chosen  by  Red,  the  expectation  of  Blue,  if  it  exists,  is 
given  by 

E(F,  y)  =  £  M(x,  y)  dF(x). 

Now  suppose  that  Red  chooses  y  by  means  of  the  distribution  function 
G{y);  then  the  expectation  of  Blue,  if  it  exists,  will  be 

E(F,  G)  =  ft  jQl  M(x,  y)  dF(x)  dG(y). 

E{F,  (?)  is  Blue’s  expectation  if  he  plays  a  strategy  determined  by  the 
distribution  function  F(x)  and  Red  plays  a  strategy  determined  by  the 
distribution  function  G(y). 

Suppose  the  following  two  expressions  exist: 

max  min  E(F,  G)  =  V\, 

F  O 

min  max  E(F,  G)  =  Vi. 
a  f 

Then  there  exists  a  distribution  function  F*  such  that  Blue  can  receive 
(in  expectation)  at  least  t>i.  There  also  exists  a  distribution  function  G* 
such  that  Blue  gets  at  most  v2. 

In  general,  v\  <  v2.  However,  if  Vi  =  v2,  or  if 

(7.1)  max  min  E(F,  (?)  =  min  max  E(F,  (?)  =  v, 

f  o  OF 
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then  we  call  v  the  value  of  the  game  to  Blue.  Further,  there  exists  an  F* 
such  that  Blue  receives  at  least  v  regardless  of  Red’s  mixed  strategy,  i.e., 

min  E(F*,  (?)  =  v. 
a 

Therefore 

(7.2)  E(F*,  (?)  >  v  for  all  (?. 

Similarly,  there  exists  a  (?*  such  that 

(7.3)  E(F,  (?*)  <  v  for  all  F. 

Thus  F*,  G*  are  called  optimal  mixed  strategies  for  Blue  and  Red,  respec¬ 
tively.  The  pair  F*,  (?*  is  also  called  a  solution  of  the  game.  We  also  have 

(7.4)  E(F*,  (?*)  =  v. 

2.  EXISTENCE  OF  OPTIMAL  STRATEGIES 

From  the  minimax  theorem  on  finite  games,  it  follows  that  every  game 
with  a  finite  number  of  strategies  has  a  solution.  However,  every  infinite 
game  does  not  have  a  solution.  There  are  examples  of  infinite  games  which 
do  not  have  solutions.  However,  if  the  payoff  function  M (x,  y)  is  contin¬ 
uous  in  each  of  the  two  strategic  variables,  then  it  can  be  proven  that  the 
game  always  has  a  solution.  Thus  if  Mix,  y )  is  continuous  in  x  and  y, 
there  exists  a  pair  of  distribution  functions  F*,  (?*,  one  for  each  player 
such  that 

max  f '  f 1  M{ x,  y)  dF(x)  dG*{y )  =  min  [  [  M{x,  y)  dF*{x)  dG{y). 

F  Jo  JO  Q  j  j 

We  can  also  state  that  F*,  G*  is  a  solution  and  v  is  the  value  of  the  game, 
if  and  only  if 

(7.5)  max  f 1  M(x,  y)  dG*{y )  =  min  f  M(x,  y)  dF*(x)  =  v. 

x  J®  y  J® 

To  prove  (7.5),  let  us  first  show  that  if  P(x)  is  continuous,  then 

max  f1  P(x)  dF(x)  =  max  P{x). 
f  Jo  x 

Since  P(x)  is  continuous  on  the  interval  [0,  1],  it  has  a  maximum.  Let 
the  maximum  occur  at  a,  or 

max  P(x)  =  P(a). 

X 

Hence 

P(a )  >  P(x)  for  all  x  in  [0,  1]. 

For  any  distribution  function  F,  it  follows  that 

P(a )  dF{x)  >  /QX  P(x)  dF(x). 
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Hence  for  any  F 

P(a)  >  JT1  P(x)  (IF (x) . 

Let  us  compute  the  least  upper  bound  of  the  numbers 


J  P(x)  dF(x). 

That  is,  we  wish 

sup  (  P(x)  dF(x). 

F  J 


Now 

sup  J  P(x)  dF(x )  >  jQ  P(x)  dla(x)  =  P(a). 

But,  for  any  F 

P(a)  >  jol  P(x)  dF(x). 

In  particular, 

P(a)  >  sup  f1  P(x)  dF(x ). 

F  J° 

Hence 

sup  f1  P(x)  dF(x)  —  P(a)  =  [  P{x)  dJa(x). 

Thus  the  least  upper  bound,  with  respect  to  F,  of  the  numbers  jP(x)  dF(x) 
is  actually  assumed  at  F  —  Therefore 

max  f1  P(x)  dF{x)  -  P (a)  =  max  P(x). 

F  JO  x 

Similarly,  we  can  show 


min  f 1  P(y)  dG{y )  =  min  P(y). 

g  Jo  y 

Now  (7.5)  follows  from  the  fact  that 

max  Jo'  j*  M(x,  y)  dF{x)  dG*(y)  =  J*  [max  j*  M(x,  y)  dF(x)J  dG*(y ) 

=  f 1  max  M{x,  y)  dG*(y) 

X 

=  max  f 1  M(x,  y)  dG*{y) 
x  JQ 

and  min  P  P  M(x,  y)  dG(y)  dF*{x)  =  min  P  M(x,  y)  dF*{x). 

G  JQ  JQ  y  J° 


3.  PROPERTIES  OF  OPTIMAL  STRATEGIES 

Let  H*(x)  be  Blue’s  expectation  if  he  uses  a  pure  strategy  x  and  Red 
uses  an  optimal  strategy  G*(y),  i.e., 
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H*(x)  =  fol  M(x,  y )  dG*(y). 

Similarly,  let  K*(y )  be  Blue’s  expectation  if  Red  uses  a  pure  strategy  y 
and  Blue  uses  an  optimal  strategy  F*,  i.e., 

K*(y)  =  JT1  M(x,  y )  dF*(x). 

The  following  properties  are  readily  proven: 

(i)  H*( x)  <  v  <  K*(y)  for  all  x  and  y  in  [0,  1].  Each  pure  strategy 
when  used  against  an  opponent’s  optimal  mixed  strategy  cannot 
yield  a  higher  expectation  than  an  optimal  strategy. 

(ii)  max  H*(x)  —  min  K*(y)  =  v.  Each  player  has  at  least  one  pure 

x  V 

strategy  which,  if  used  against  an  opponent’s  optimal  strategy, 
yields  the  value  of  the  game. 

(iii)  If  H*(x0)  <  v  then  pr  {£  =  x0}  =  0.  A  player’s  optimal  mixed 
strategy  contains  no  strategy  which  yields  less  than  the  value  of 
the  game  when  that  strategy  is  used  against  the  opponent’s 
optimal  strategy.  If  K*(y0 )  >  v  then  pr  {£  =  yQ}  =  0. 

(iv)  If  pr  {£  =  x0}  >  0  then  H*(xa)  =  v.  Each  pure  strategy  in  the 
mixed  strategy  must  yield  the  value  of  the  game  when  used  against 
the  opponent’s  optimal  mixed  strategy. 

Example.  Suppose  the  payoff  is  given  by 

M(x,  y)  =  (x  -  y)\ 

Let  us  verify  that  the  optimal  strategy  for  Blue  is  to  play  x  —  0  and  x  =  1 
at  random  with  equal  probability  and  the  optimal  strategy  for  Red  is  to 
play  y  —  We  wish  to  verify  that 

F*(x)  -  \I0(x)  +  hh{x),  G*(y)  =  Im(y). 

Evaluating,  we  have 

max  f  M(x,  y)  dG*(y)  =  max  M(x,  §) 

X  X 

=  max  ( x  —  |)2  =  j. 

X 

min  f 1  M(x,  y)  dF*(x)  =  min  [|M(0,  y)  +  W(L  y )] 

y  y 

=  min  [%y2  +  |(1  -  y)2]  =  J. 
y 

Since 

max  JQl  Mix,  y)  dG*(y)  =  min  M{x,  y)  dF*(x)  = 
it  follows  that  F*,  G*  as  defined  above  are  optimal  strategies  and  v  —  |. 
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We  also  have,  for  this  game,  that 

H*(x)  =  f'  M(x,  y )  dG*{y)  =  (*  -  i)2 

K*(y)  =  JT1  M(x,  y)  dF*{x)  =  \[y2  +  (1  -  y)2]. 


Further 


ma  xH*(x)  =  H*l  0)  =  H*(  1)  = 
min  K*(y)  =  K*lh)  =  i 


#*(x)  <  |  for  all  x  ^  0,  1 

fc*(2/)  >  i  for  a11  V  **  i 

If  £  is  the  random  variable  selected  by  the  optimal  mixture,  note  that 
for  Blue:  pr  {£  0,  1}  =  0, 

for  Red:  Pr  {£  ^  5}  —  0. 

Exam-pie.  We  now  give  an  example  of  a  game  which  has  a  continuous 
solution.  Suppose  the  payoff  is  given  by 

Mix,  y)  =  \y  -  s I  (1  -  I y  -  *D- 

Let  us  verify  that  a  solution  of  this  game  is  F":(x)  =  x  and  G*(y )  =  y. 
We  have 

i/*(x)  =  £  Mix,  y)  dG*(y)  =  \y  —  x|  (1  —  1  y-  *|)  dy 

=  fx  (x  -  x2  +  2 xy  -  y  -  y 2)  dy 
Jv= 0 

+  fy=1  (y  -  V2 -i- 2xy  -  x  -  x2)  dy  = 
Jy—x 

K*ly)  =  f('  M lx,  y)  dF*(x) 

=  jQl  I y  —  x|  (1  —  | y-  x|)  dx  =  h 

gince 

H*(x)  s  |=  K*(j/)  for  all  x  and  y 

it  follows  that  F*{x)  =  x,  G*(y)  =  y  are  optimal  strategies  and  v  =  1- 


4.  DELAYED  FIRING 

An  example  of  a  game  with  a  discontinuous  payoff  which  has  a  solution 
is  the  operational  problem  of  scheduling  the  firing  of  a  missile  requiring 
an  exposure  time.  Suppose  that  Blue  plans  to  fire  one  missile  before  T 
hours  have  expired.  However,  in  order  to  fire  the  missile,  Blue  must  ex¬ 
pose  it  for  e  hours,  where  e  <  T,  during  which  time  the  missile  is  vulnerable 
to  attack  by  Red.  Let  us  assume  that  Red,  not  knowing  Blue’s  decision, 
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has  only  one  opportunity  to  attack.  When  is  the  optimal  time  for  Blue  to 
expose  his  missile?  When  is  the  optimal  time  for  Red  to  attack? 

A  strategy  for  Blue  is  a  choice  of  time  X  for  him  to  begin  exposing  the 
missile  for  a  time  e,  where  0  <  X  <  T  —  e.  Since  Red  does  not  know 
Blue’s  choice,  a  strategy  for  Red  is  a  choice  of  time  Y  for  Red  to  attack 
Blue,  where  0  <  Y  <  T.  Blue  will  fire  his  missile  at  time  X  +  e  if  he 
has  not  been  attacked  by  Red  during  the  time  of  missile  exposure.  That 
is,  if  Red  attacks  either  before  or  after  missile  exposure,  Blue  will  be  able 
to  fire  his  missile. 

Let  the  payoff  to  Blue  be  1  if  he  fires  the  missile  (i.e.,  Blue’s  missile 
was  not  attacked  during  exposure)  and  0  otherwise.  Then  in  terms  of  the 
strategies  of  the  players  the  payoff  is  described  by  the  following 
discontinuous  payoff  function: 


M(X,  Y)  = 


1  HY<X<T-e  or  X  +  e<Y<T 
0  otherwise. 


Let  t  =  e/T,  x  =  X/T,  and  y  =  Y/T;  then  Blue’s  strategies  now  range 
over  the  interval  [0,  1  —  <]  and  Red’s  strategies  now  range  over  the  interval 
[0,  1].  The  payoff  function  becomes 


(7.6) 


M(x,  y )  = 


1  \i  y  <  x  <  \  —  t  or  x  +  t<y<\ 
0  otherwise. 


Let  us  assume  that  t  <  % ;  then  it  is  readily  shown  that 
0  =  sup  inf  M(x,  y)  <  inf  sup  M(x,y)  =  1. 

x  y  y  x 

Hence  if  a  solution  exists  it  must  be  in  terms  of  mixed  strategies.  Let  F(x) 
and  G (y)  be  mixed  strategies  of  Blue  and  Red,  respectively.  The  expected 
payoff  to  Blue  is 

E(F,  G )  =  /01_‘  JT1  M(x,  y )  dG(y)  dF(x ) 

(7.7)  =  /01_t  /;  dG(y)  dF{x)  +  dG(y)  dF(x) 

=  /01_<  [<?(*)  +  1  -  G(x  +  <)]  dF(x). 


Using  (7.7),  we  have 

inf  sup  E(F,  G)  =  inf  sup  [G(x)  +  1  —  G{x  +  <)] 

GF  G  0<i<I-J 

=  1  —  sup  inf  [G(x  +  t)  —  G(x)]. 

G  0<x<l-t 


a(G)  =  inf  [G(x  +  t)  -  <?(*)]. 

0<i<1-( 


Now  let 
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Then,  for  all  x,  we  have 

a(G)  <  G(x  +  t)  -  G(x) 

or  G(x  +  t)>  G(x)  +  a(G)  for  all  0  <  x  <  1  -  t. 

Setting  x  =  0,  t,  2t,  .  .  .  ,  (n  —  1)£,  we  obtain 
G(t)  >  a(G) 

G{2t)  >  G(t)  +  a(G)  >  2 a(G) 

G(3()  >  3 a{G) 


G(nt)  >  na(G), 

where  nt  <  1  <  (n  +  1)1,  i.e.,  n  =  [1  /(]  or  the  largest  integer  contained 
in  l/i. 

Therefore,  we  have 

na(G)  <  G(nt)  <  G(  1)  =  1 

and  a(G)  <  -  for  all  G. 

n 

It  follows  that 

(7.8)  sup  inf  [G(x  +  t)  —  G(a:)]  <  ~ 

G  0<x<l-t  n 

We  shall  show  that  the  equality  holds  in  (7.8).  We  do  this  by  exhib¬ 
iting  a  function  G  which  yields  the  sup.  Suppose  that  1/f  is  an  integer,  or 
nt  —  1.  If  we  take  G(y)  =  y,  we  have 

inf  [(?(*  +  t)  -  G(x)]  =  t  =  ~ 

0<x<l-t  n 

Therefore 

sup  inf  [G(x  +  t)  —  G(rr)]  =  -• 

G  x  n 


Now  suppose  that  1  /t  is  not  an  integer,  and  let  [1/f]  =  n.  Consider 
the  distribution  function 


Then 


G{y)  =  “  S  Ij/(n+i)(y)- 
n  j= i 

G(X  +  t)  —  G(x)  =  -  S  [7j/(n+l)Ct  +  f)  —  Ij/(n+l)(x)]. 
n  j= i 


For  this  function  G(y)  we  have 


Hence,  we  have 


inf  \G{x  +  f)  —  G(x )]  =  — 

0<i<1-(  '  n 


sup  inf  [G(x  +  t)  —  G(x)]  =  - 

G  0  <x  <  i  —  t  n 


1 

[1  /t] 


114  SOLUTION  OF  INFINITE  GAMES 

Therefore 


wmpiffF.ej-i-jjL- 

By  a  similar  argument  we  can  show  that 

sup  inf  E(F ,  (?)  =  1  -  p™ 

F  G  Ll/JJ 

Therefore  the  game  has  a  value  given  by 

v==l~\m 

An  optimal  strategy  for  Blue  is 

F*(x)  =  ~  "s'  Ijt(x). 
n  o 

To  verify  that  F*  is  optimal,  we  consider  two  cases.  First,  suppose  l  —  1/n, 
or  1  /t  is  an  integer ;  then 

E(F*,  G)  =  Jo1_t  [<?(*)  +  1  -G(x  +  <)]  dF*{x) 

=  1  -  /0'“‘  [<?(*  +  0  -  <?(*)]  dF*(x) 

=  1  -  £  "s  /T  [G(*  +  0  -  <?(*)]  dlit{x) 
n  o  -Z0 

=  1  -  £  *2  IGUt  +  t)-  G(jt)] 

Tl  y=0 

=  1  -  -  [(?(1)  -  (?(0)]  =  1  -  -  =  1  -  pt~ 
n  n  [I/O 

Suppose  — <  t  <  -  ;  then  for  all  G, 
n  +  1  n 


E{F*,  (?)  =  JT1  ‘  [(?(*)  +  1  -  G(x  +  0]  dF*(x) 

=  1  [G(jt  +  t)  -  G(jt)] 

n  j=o 

=  1  -  -  [G(nt)  -  0(0)]  >  1  -  -  =  1  -  ~ 
n  n  [l/<] 

In  terms  of  the  original  parameters,  the  solution  of  the  game  is 
as  follows: 


(i)  The  value  of  the  game  =  1 


1 

[T/e] 
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(ii)  Blue’s  optimal  strategy  is  given  by 

(iii)  Red’s  optimal  strategy  is  given  by 

G*(Y)  =  Y,  if  T/e  is  an  integer, 

1  T/e 

G*(Y)  =  S1 1}/(lT/e]+i)(Y)  if  T/e  is  not  an  integer. 


5.  EXAMPLE  OF  GAME  WITHOUT  SOLUTION 


We  shall  now  give  an  example  of  a  game  with  a  discontinuous  payoff 
function  which  does  not  have  a  solution,  nor  a  value. 

Consider  the  infinite  game  in  which  each  player  chooses  a  number  in 
the  closed  interval  [0,  1].  Let  x  and  y  represent  the  choices  by  Blue  and 
Red,  respectively.  Define  the  payoff  to  Blue  as  follows : 


0  for  x  —  y, 

M{x,y)  —  \  —  1  for  x  =  1 ,  y  <  1 ;  and  x  <  y  <  1 ; 

_  +1  for  y  =  1,  x  <  1;  and  y  <  x  <  1. 

The  above  payoff  may  be  represented  as  in  Fig.  15  on  the  unit  square. 


/ 


Let  us  compute  Blue’s  expectation  E(F,  y)  if  Blue  uses  the  mixed 
strategy  F  and  Red  uses  the  pure  strategy  y.  We  have 

(i)  if  0  <  y  <  1, 

E{F,  y)  =  f1  MdF  =  -1  [v~°dF  +  1  f1^  dF  -  1  P  dF 
'  ’  Jx=  0  Jo  Jy+0  J 1-0 

=  -F(y  -  0)  +  F(  1  -  0)  -  F{y)  -  F{  1)  +  F(1  -  0) 

=  -1  +  2F(1  -  0)  -  [F{y  -  0)  +  F(y)]; 
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(ii)  tty  =  0, 

=  2F(1  -  0)  -  F(  1)  =  2F(1  -  0)  -  1; 

(iii)  tty  —  1, 

E(F,y)  =  + 1  Jo1_V  =  F(1  -  0). 

Since  F(y)  >  F(y  —  0),  we  have  from  the  above  that  if  0  <  y  <  1, 

E(F,  y)  <  -1  +  2F(1  -  0)  -  2F(y  -  0) 

=  —  1  +  2[F(1  —  0)  —  F(y  —  0)]. 

Therefore,  for  any  F  and  any  e,  (0  <  e  <  §),  there  exists  a  y0  such  that 

E(F,  y0)  <  -1  +  e. 

In  other  words,  for  any  F,  there  exists  a  G,  namely  G  =  Iya(y)  such  that 

E(F,  G)  <  -1  +  *. 

Therefore 

inf  E(F,  G)  <  -1  +  « 

G 

and  sup  inf  E{F,G)  <  —  1  +  e. 

F  G 

Similarly,  we  can  show  that  for  any  G  and  any  e,  such  that  (0  <  e  <  §) 
there  exists  an  F  such  that 

E(F,  G)  >  +1  - 

Therefore 

inf  sup  E(F,  G)  >  1  —  «. 

G  F 

Thus  we  have  shown  that  for  this  game,  we  have 

inf  sup  E(F,  (?)  >  sup  inf  E(F,  G). 

G  F  F  G 


8  GAMES  WITH  CONVEX 
PAYOFF  FUNCTIONS 


1.  CONVEX  PAYOFF  FUNCTIONS 

For  many  military  and  economic  games  the  continuous  payoff  function 
is  also  convex  in  one  variable.  For  example,  the  outcome  of  many  types 
of  attack-defense  games  is  naturally  described  by  a  convex  payoff  function. 
In  such  cases  it  is  possible  to  describe  the  form  of  the  optimal  strategies. 

A  function  /  is  called  convex  in  the  interval  [0,  1]  if,  for  every  X  for 
which  0  <  X  <  1,  and  for  every  pair  of  strategies  yi,  y2,  we  have 

+  (1  —  X)y2]  <  \f(yi)  +  (1  -  X)/(y2). 

If  the  equality  never  holds  for  X  ^  0,  X  ^  1,  we  call  f  strictly  convex. 
Geometrically,  if  a  function  is  convex  then  between  any  two  points  of  the 
graph  of  the  function,  the  graph  never  lies  above  the  segment  connecting 
the  two  points.  The  function  is  strictly  convex  if  the  graph  of  the  function 
always  lies  below  the  line  segment.  If  a  convex  function  is  not  strictly 
convex,  its  graph  consists  in  part  of  straight  line  segments. 

Suppose  /  is  a  function  of  n  variables ;  then  /  is  convex  if  for  every  pair 
of  distinct  points  (ylt  y2, .  .  . ,  yn)  and  (yi,  y2, .  .  . ,  yn)  on  the  interval, 
we  have 

/[Xyi  +  (1  -  \)yi, .  .  . ,  \yn  +  (1  -  X)y„]  <  \f(yh  ...,yn) 

+  (1  -  *)/(2/l,  •  •  •  ,  yn). 

We  call  /  strictly  convex  if  the  equality  never  holds  for  X  5*  0,  X  ^  1. 
Suppose  /  is  twice  differentiable ;  then  /  is  also  convex  in  y  if 
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The  function  /  is  strictly  convex  if 

>  0  for  all  y. 

dy 2 

Let  us  assume  that  M  (x,  y)  is  strictly  convex  in  y  for  each  x  and  that 
M(x,  y)  is  twice  differentiable  (with  respect  to  y).  Then  for  any  distribu¬ 
tion  function  F{x),  it  follows  that  the  function 

K(y)  =  /0l  M(x,  y)  dF(x) 

d2M 

is  also  strictly  convex.  For,  if  >  0  for  all  y,  then 

?:KM  =  f  d'2M(x  y)  dF^  >  0 

dy 2  J  dy 1 

2.  OPTIMAL  PURE  STRATEGY  FOR  RED 

Let  us  assume  that  M(x,  y)  is  strictly  convex  in  y  for  each  x  and  is 
continuous  in  both  variables.  Suppose  the  game  has  a  solution,  which  will 
be  verified.  Let  F*(x)  be  an  optimal  strategy  for  Blue.  Then  Blue  may 
announce  the  choice  F*(x)  to  Red,  who  will  pick  some  G*  such  that 

j  j  Mips,  y)  dF*{x )  dG*(y)  =  min  f  J  M(x,  y)  dF*(x )  dG(y) 

=  min  j  y)  dF*(x) 

=  min  K*(y). 

V 

Now  K*(y)  is  strictly  convex  in  y.  Hence  K*(y)  assumes  its  minimum  at 
one  point.  Therefore  an  optimal  strategy  for  Red  is  a  pure  strategy,  the 
y  which  minimizes  K*(y). 

3.  VALUE  OF  GAME  IS  min  max  M(x,  /) 

y  * 

Continuing  with  the  assumption  that  M (x,  y)  is  continuous  and  strictly 
convex  in  y,  we  can  obtain  the  value  of  the  game.  1  or  any  game,  its 
value  is  given  by 

v  =  min  max  /  M  (x,  y)  dG(y). 

G  i  J 

We  have  shown  that  Red’s  optimal  strategy  is  a  pure  strategy.  Hence  we 
need  consider  only  those  G’s  which  are  one-step  distribution  functions. 
Therefore 

v  =  min  max  [ 1  M  (x,  y)  dly{y)  =  min  max  M(x,  y). 

y  x  J 0  y  x 
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4.  RED’S  OPTIMAL  PURE  STRATEGY 

An  optimal  strategy  y*  for  Red  must  be  such  that 

v  —  min  max  M{x,  y)  =  max  M(x,  y*). 

y  x  x 

Thus  y*  has  the  property  that  it  minimizes  max  M{x,  y).  From  the  strict 

X 

convexity  of  M(x,  y)  it  follows  that  y*  is  unique. 


5.  BLUE’S  OPTIMAL  STRATEGIES 


Blue’s  optimal  strategy  may  be  pure  or  mixed,  depending  on  the  loca¬ 
tion  of  Red’s  optimal  strategy.  If  Red’s  optimal  strategy  is  an  end  point 
of  the  interval  [0,  1],  i.e.,  if  y*  =  0  or  1,  then  Blue  has  a  pure  strategy. 
If  Red’s  optimal  strategy  y*  is  such  that  0  <  y*  <  1,  then  an  optimal 
strategy  for  Blue  is  to  randomize  over  two  values  of  x. 

Let  Red’s  optimal  strategy  be  an  end  point  of  the  interval  [0,  1],  e.g., 
y*  —  0.  Then 

max  M{x,  0)  =  v. 

X 

Hence 


M(x,  0)  <  v  for  all  x. 

Let  XQ  be  the  set  of  points  x0  such  that 

M(x o,  0)  =  v  for  all  x0  in  Xa. 

Then  the  remaining  set  of  points  Xi  of  the  interval  [0,  1]  have  the  property 
that 


M (xi,  0)  <  v  for  all  Xi  in  X\. 

Now  an  optimal  strategy  for  Blue  consists  of  mixing  some  strategies  Xq 
satisfying  the  condition  M(x o,  0)  =  v.  We  shall  show  that  there  exists  an 
optimal  pure  strategy  for  Blue,  i.e.,  there  exists  some  x0  such  that 
Mix,,  y)  >  v  for  all  y.  This  is  equivalent  to  showing  that  there  exists  some 
x0  e  Xa  such  that 

v  =  min  M(x 0,  y)  =  M(x0,  0) 

y 


or  that  M(x0,  y)  is  a  nondecreasing  function  at  y  =  0. 

Suppose  that  for  every  x0  s  X0,  the  function  M(x0,  y)  is  decreasing 
at  y  —  0,  or 


dM (Xp,  0) 
dy 


—  M'(x o,  0)  <  0. 


Then  for  sufficiently  small  e  >  0,  and  every  x0  in  X0, 


M(x o,  y)  <  v 


for  0  <  y  <  e. 
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Since  M(x h  y)  is  continuous  in  y,  it  follows  that  for  sufficiently  small 
e  >  0,  and  every  X\  in  Xi, 

M  (xh  y)  <v  for  0  <  y  <  e. 

Therefore,  for  every  x  in  [0,  1], 

M  (x,  yi)  <  v  0  <  yi  <  «. 

Hence 

max  M(x,  j/j)  <  v  0  <  yi  <  e. 

X 

This  yields  the  contradiction 

v  =  min  max  M(x,  y)  <  max  M{ x,  yi)  <  v. 

y  x  x 

It  follows  that  there  exists  some  x<>  e  Xo  such  that  the  function  M  (xB,  y) 
is  nondecreasing  at  y  =  0,  or 

M'(x o,  0)  >  0. 

Since  M(x 0,  0)  =  v  and  M  (x0,  y)  is  convex  in  y,  it  follows  that 

v  =  min  M(x0,  y)  or  M(x 0,  y)  >  v  for  all  y. 

V 

We  have  shown  that  if  y*  =  0,  then  Blue  has  an  optimal  pure  strategy 
x*  satisfying  two  conditions: 

M(x*,  0)  =  v,  M'(x*,  0)  >  0. 

In  a  similar  manner  we  can  show  that  if  y*  —  1,  then  Blue  has  an 
optimal  pure  strategy  x*  satisfying  two  conditions: 

M(x*  1)  =  v,  M'(x*  1)  <  0. 

Now  suppose  that  0  <  y*  <  1.  Then 

M(x,y*)<v  for  all  x. 

Let  X0  be  the  set  for  which 

M(x<>,  y*)  =  v  for  all  x0  s  Xo, 

and  let  Xi  be  the  set  for  which 

M( xi,  y*)  <  v  for  all  xi  e  Xx. 

Suppose  that  every  x0  in  X0  were  such  that 

M'(x0,  y*)  <0; 

then  we  would  be  led  to  the  same  contradiction  as  for  the  case  y*  —  0. 
Hence  there  exists  some  Xo  such  that 

M(x*0,  y*)  =  v,  M\x S,  y*)  >  0. 

In  a  similar  manner  we  can  show  that  there  exists  some  £oo  such  that 
M{x So,  y*)  —  v,  M'(xto,  y*)  <  0. 
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Now  consider  the  function 

fit)  =  y*)  +  (1  -  f)Af'(*8 o,  y*). 

We  note  that 

m  =  M’(x So,  »*)  <  0, 

/(l)  =  i/*)  <  0. 

Since  /  is  a  continuous  function  of  t,  there  exists  an  a,  where  0  <  a  <  1, 
such  that 

ccM'ixt,  y*)  +  (1  -  a)M'(x*0 0,  y*)  =  0. 

Having  determined  x%,  xJJ,  and  a  we  shall  now  show  that  an  optimal 
strategy  for  Blue  is 

F*(x)  =  aIX0*(x)  +  (1  -  a)Ixw,{x). 

We  have  that 

K*(y)  =  J  M(x,  y)  dF*(x )  =  aM{x*0,  y)  +  (1  -  a)M{x% 0,  y). 

Since  M(x,  y)  is  convex  in  y,  it  follows  that  K*(y)  is  convex  in  y.  Further 
=  aM'(xo,  y)  +  (1  -  a)M'(xt 0,  y) 

which  vanishes  at  y  =  y*.  Hence  K*{y)  assumes  its  minimum  at  y*,  or 

min  K*(y )  =  aM{x%,  y*)  +  (1  -  a)M{x% <,,  y*)  =  v. 
v 

Therefore  F*{x)  is  an  optimal  strategy  for  Blue. 

WTe  have  shown  that  if  0  <  y*  <  1,  then  an  optimal  strategy  for  Blue 
is  the  step-function 

F*(x)  =  cdxi{x)  +  (1  -  a)Ixl(x) 

where 

M(xi,  y*)  =  M(x2,  y*)  =  v, 

M'(x i,  y*)>  0  >  M'(x2,  y*), 
aM'{x i,  y*)  +  (1  —  a)M'{Xi,  y*)  -  0. 

6.  CONCAVE  PAYOFF  FUNCTIONS 

The  following  dual  result  for  concave  payoff  functions  can  be  proven 
in  the  same  way  as  for  convex  functions: 

Suppose  that  M  ( x ,  y)  is  continuous  in  both  variables  and  is  a  strictly 
concave  function  of  x  for  each  y.  Let  dM  (x,  y)/dx  exist  for  each  x  and  y 
in  the  unit  interval.  The  solution  of  the  game  is  as  follows : 

(i)  v  =  max  min  M{x,  y). 

x  v 

(ii)  Blue  has  a  unique  optimal  pure  strategy  x*. 
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(iii)  (a)  If  x*  =  0,  then  Red  has  an  optimal  pure  strategy  y*  such  that 
M(0,  y*)  =  v  and  dM( 0,  y*)/dx  <  0. 

(b)  If  x*  =  1,  then  Red  has  an  optimal  pure  strategy  y*  such  that 
M{1,  y*)  —  randdM(l,  y*)/dx  >  0. 

(c)  If  0  <  x*  <  1,  then  Red  has  an  optimal  mixed  strategy  which 
is  of  the  form 

G*(y)  =  cJVi{y)  +  (1  -  a)I4y) 
where  the  parameters  a,  yi,  y2  satisfy  the  conditions 
M(x*,  yi)  =  M{x*,  y2)  =  v, 

dM(x*,  Vl)  <Q<  dMjx*,  yi), 
dx  —  ~  dx 

dMjx*,  yi)  ,  q  _  \  dMjx*,  Vi)  =  Q 

“  dx  ~r  ^  1  dx 

7.  GENERAL  CONVEX  PAYOFF 

In  the  previous  discussion  we  assumed  that  the  payoff  wras  strictly 
convex  or  strictly  concave.  Most  of  the  results  are  still  valid  if  the  strict¬ 
ness  assumption  is  removed.  However,  it  is  no  longer  true  that  the  optimal 
strategies  are  unique. 

Although  we  have  assumed  that  each  player’s  strategy  space  is  one¬ 
dimensional,  similar  arguments  would  prove  analogous  results  for 
n-dimensional  strategy  space. 

Example.  Suppose  the  payoff  function  is 

M{x,  y)  =  (x  —  y)2. 

Since 

d2M(x,  y)  =  „ 
d2y 2 

it  follows  that  M(x,  y)  is  convex  in  y  for  each  x.  Hence  the  value  of  the 
game  is 

v  —  min  max  (x  —  y)2  =  min  max  [y2,  (1  —  ?/)2]  =  i- 

y  x  v 

Red’s  optimal  strategy  is  defined  by  that  pure  strategy  which  minimizes 
max  [y2,{  1  -  y)2].  Therefore  y*  =  Blue’s  optimal  strategy  is  a  mix¬ 
ture  of  those  pure  strategies  x.  such  that  M(x,-,  |)  =  v,  or  (x  —  J)2  =  I- 
This  yields  x  =  0  and  1.  To  obtain  the  weights  a  and  1  —  a,  we  solve 
the  equation 

oM'(0, 1)  +  (1  -  a)M'(l,  h)  =  0. 

Substituting,  we  obtain  a  —  5.  Therefore  Blue’s  optimal  strategy  is 
F*(x)  =  |/o(x)  + 
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8.  DEFENSE  OF  TWO  TARGETS  AGAINST  ATTACK 

Suppose  that  Red,  the  Defender,  has  two  targets  T\  and  7\  of  values 
ki  and  k2,  respectively,  which  are  to  be  defended  against  an  attack  by  Blue. 
Let  us  assume  Red  and  Blue  are  equally  strong,  i.e.,  they  have  the  same 
number  of  forces,  S. 

A  strategy  for  Blue  is  an  allocation  of  x  attacking  units  to  Ti,  where 
0  <  x  <  S,  and  the  remainder,  -S’  -  x,  to  TV  A  strategy  for  Red  is  an 
allocation  of  y  to  Th  where  0  <  y  <  S,  and  S  -  y  to  TV 


Target  Ti  Target  T2 
Value  ki  k2 

Offense  allocation  x  S  —  x 

Defense  allocation  y  S  —  y 


Let  the  payoff  to  Blue  be  proportional  to  the  number  of  attacking  units 
that  get  through  to  target  and  the  target  value.  Thus  if  x  >  y,  then 
S  —  x  <  S  —  y,  in  which  case  we  assume  that  x  —  y  units  survive  to  hit 
target  Ti  and  none  survive  to  hit  target  1\.  In  this  case  the  payoff  to  the 
attack  is  kf  x  —  y).  If  x  <  y  then  S  —  x  ~>  S  —  y,  and  y  —  x  units  survive 
to  attack  target  T2  while  none  survive  to  hit  target  7Y  The  payoff  can 
be  summarized  as  follows: 


M(x,  y)  = 


h(x  -  y)  if  x>y 
h  {y  —  x)  if  x  <y. 


One  may  interpret  kx  to  be  the  payoff  per  attacking  unit  that  penetrates 
the  defenses  at  7\. 

It  is  evident  that  M(x,  y)  is  a  convex  function  of  y  for  each  x.  It  consists 
of  two  lines  as  shown  in  Fig.  16.  Therefore 


v  =  min  max  M(x,  y)  =  min  max  [k2y,  kfS  —  w)]. 

V  X  y 


Figure  16 
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Now  the  function 

max  [fay,  fa (S  —  y)) 

assumes  its  minimum  at  that  y  for  which 


faS 


fay  =  fa(S-y)  or  y*  = 


fa 


Therefore  Red’s  optimal  strategy  is  to  allocate  ^  ^ 

Ti  and  the  remainder,  ; — ^T~T  &  defense  units,  to  T2. 

K\  *T“  K  2 

The  value  of  the  game  is 

fafa 


S  defense  units  to 


v  = 


fa  “I-  fa 


S. 


Solving  for  Blue’s  optimal  strategy,  we  set 

fa  S  \ 

M  {*•  fe+rj  “  •• 

This  equation  yields  two  solutions: 

Xi  =  0,  x2  =  S. 


Now 

Hence 

where 


3M( 0,  y*)  dM(S,&  =  _k 

dy  2’  dy 


F*(x)  =  al0(x)  +  (1  -  a)IB(x) 


fa 


ocfa  +  (1  —  a)(  —  fci)  —  0  or  at 


We  summarize  the  solution  of  this  game  as  follows:  The  Defender 
splits  his  forces  and  may  adopt  a  fixed  deployment  of  such  forces  place 
fa/ {fa  +  fa)  of  them  at  T,  and  fa/ (fa  +  fa)  of  them  at  T,.  The  Attacker’s 
optimal  strategy  is  mixed.  The  Attacker  concentrates  his  forces  on  either 
T,  or  Ti  chosen  at  random.  He  chooses  T\  with  probability  fa/ (fa  +  fa) 
and  he  chooses  T2  with  probability  fa/ (fa  +  fa)- 

For  example,  if  target  T 2  is  three  times  as  valuable  as  target  T 1,  or 
fa  =  3fa,  then  T2  is  defended  by  three-fourths  of  the  defensive  force.  The 
optimal  strategy  for  the  Attacker  is  to  strike  7'i  with  all  his  forces  with 
0.75  probability. 


9.  DEFENSE  OF  MANY  TARGETS  OF  DIFFERENT  VALUES 

An  example  of  the  application  of  the  results  on  an  n-dimensional  con¬ 
vex  payoff  function  is  a  simple  air  defense  problem.  Like  most  battle 
situations,  the  combat  between  air  attack  and  air  defense  can  be  viewed  as 
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a  zero-sum  two-person  game:  The  attacker  seeks  the  greatest  possible 
gains  in  the  form  of  the  destruction  of  targets,  and  the  defender  wishes  to 
make  these  gains  as  small  as  possible. 

An  important  decision  of  the  defender  in  a  battle  situation  is  the  dis¬ 
tribution  of  his  total  defense  resources  among  his  targets.  An  important 
decision  of  the  attacker  is  the  distribution  of  his  total  attacking  force 
among  those  targets.  We  shall  consider  this  game  in  a  very  simplified 
form  in  which  we  assume  only  a  single  choice  for  each  player,  namely, 
for  the  attacker  the  choice  of  an  allocation  of  his  resources  among  targets, 
and  for  the  defender  the  choice  of  an  allocation  of  his  resources  among 
those  targets. 

We  wish  to  answer  such  questions  as:  Should  all  the  targets  be  de¬ 
fended?  If  only  some  of  the  targets  are  to  be  defended,  how  shall  these  be 
selected?  How  should  the  attacker  select  his  targets? 

To  answer  these  questions  we  consider  the  following  game  model: 

Defense.  The  defender,  Red,  has  D  units  of  defense  to  distribute  among 
his  n  targets,  which  we  label  7\,  T->,  -  •  •  ,  Tn.  Let  us  assume  that  the  n 
targets  have  values  kh  k2, ... ,  kn,  respectively,  and  are  ordered  as  follows : 

0  <  h  <  k2  <  .  .  .  <  kn. 

Attack.  The  attacker,  Blue,  has  A  units  of  attack  to  distribute  among 
the  n  targets.  Let  us  assume  the  attack  is  at  least  as  strong  as  the  defense, 
or  A  >  D. 

Strategy.  A  strategy  for  Blue  is  an  allocation  of  his  attacking  force  A 
among  the  n  targets.  Thus  a  strategy  for  Blue  is  a  set  of  numbers 
Xi,  x2,  ...  ,xn  such  that 


Xi  >  0  and  2  x<  =  A. 

i=i 

A  strategy  for  Red  is  a  set  of  numbers  y\,  y2, ...  ,yn  such  that 

n 

Vi>  0  and  2  y,  =  D. 

* = i 

Each  y(  represents  the  number  of  defensive  units  allocated  to  target  T,. 

Payoff-  Let  us  assume  that  one  unit  of  defense  can  check  one  unit  of 
attack.  Further,  let  us  assume  that  the  amount  of  damage  to  any  target 
is  proportional  to  the  number  of  attacking  units  which  outnumber  the 
defending  units,  the  coefficient  of  proportionality  depends  on  the  partic¬ 
ular  target.  Finally,  let  us  assume  that  the  payoff  is  the  sum,  over  the 
targets,  of  the  damage  to  each  target.  Thus  the  payoff  to  Blue  is 

n 

M(x,  y)  =  2  ki  max  (0,  x,  -  yf) 

1  =  1 
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where 

n  n 

>  0,  yi  >0,  2  Xi  —  A,  and  2  Vi  =  D. 

t  —  1  i  —  1 

Solution.  It  is  apparent  that  M(x,  y)  is  convex  in  y  for  each  x.  It  is 
also  convex  in  x,  for  each  y.  Therefore  Red,  the  defender,  has  a  pure  strat¬ 
egy  which  is  optimal.  The  attacker  has  a  mixed  strategy  which  is  optimal. 

It  can  be  verified  that  it  is  optimal  for  the  defender  to  distribute  his 
defensive  force  D  among  the  high-valued  targets.  It  can  also  be  verified 
that  it  is  optimal  for  the  attacker  to  select  one  of  the  high-valued  targets 
at  random,  subject  to  a  given  probability  distribution,  and  then  allocate 
his  entire  attacking  force  on  that  target. 

To  give  a  more  precise  description  of  the  optimal  strategies  for  the  two 
players,  we  introduce  the  following  notation : 


m  —  smallest  value  of  s  such  that  ls  >  0. 


s  =  1,  2,  .  .  .  ,n; 
s  =  1,  2,  .  .  ,,n; 


In  terms  of  the  above  definitions,  we  can  verify  the  following: 

The  attacker’s  optimal  mixed  strategy  is: 

(a)  Never  attack  the  low-valued  targets  Ti,  Ti,  ...  ,  Tm- 1. 

(b)  Use  the  entire  attacking  force  A  on  a  target  selected  at  random, 
subject  to  the  following  probability  distribution : 

pr  {Xi  =  A}  —  m  <  i  <  n. 

fCi 


The  defender’s  optimal  pure  strategy  is : 

(a)  Leave  undefended  the  low-valued  targets  Ti,  T-h  .  .  .  ,  Tm-i- 

(b)  Defend  the  high-valued  targets  Tm,  Tm+h  ...  ,Tn  by  placing 

A  —  ^  (n  —  m  +  1  —  m  <  i  <  n, 


units  at  the  ith  target. 

The  value  of  the  game  to  the  attacker  is 

V  =  jd  ( km  ~~  Itn)  ■ 

It  is  of  interest  to  note  that  if  m  <  i  <  n,  then 
hi  max  (0,  A  —  y<)  ==  ktA  ^ 

Ki 

=  KnA  ^7 

=  A.  (fom  lm)  =  V . 


(n  —  m  +  1  —  ^ 

,-m+l-f) 
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Thus  at  each  of  the  defended  targets,  the  attacker  gets  the  value  of  the 
game  if  that  target  is  attacked  by  the  entire  attacking  force. 

If  1  <  i  <  m  —  1,  then 

hi  max  (0,  A  —  yt)  =  hiA  <  A(km  —  lm). 

Thus,  at  each  of  the  undefended  targets,  a  concentrated  attack  yields  less 
than  the  value  of  the  game.  If  the  defender  has  allocated  his  defenses 
optimally,  there  is  no  soft  spot  in  his  targets. 

Example.  Suppose  the  defender  has  five  targets  to  defend  against  an 
attack.  Let  the  target  values  be  the  following: 

fci  =  r^,  hi  =  ks  =  hi  =  if,  hi  —  j. 

Suppose  that  the  attacker  and  the  defender  have  equal  forces,  or 
A  =  D  =  S.  We  have 


]_ 

h 

h 

1 

h2 

h 

1 

h3 

h 


=  12  +  9  +  7  +  5  +  4  =  37, 

i  =  1 

-  *i(5  -1  +  1-1)  =  ^-  ^<0, 

2  ^-  =  9  +  7  +  5  +  4  =  25, 

i=  2  »W 

h3  —  hi{  5  —  2+1  —  1)=-  —  ^<0, 
2  r  =  7  +  5  +  4  =  16, 

%  =3 

hz  -  hz(  5  -  3  +  1-1)  =  |-  ^>0. 


Since  l3  >  0  and  h  <  0,  it  follows  that  m  =  3.  Therefore  the  optimal 
allocations  are  such  that  the  first  two  targets  are  not  defended,  nor  are 
they  attacked.  Targets  3,  4,  and  5  are  defended  as  follows: 


4-f+>Hs 

44<2>}-f 


defense  units  at  target  T3, 
defense  units  at  target  Tit 
defense  units  at  target  T6. 


The  optimal  strategy  for  the  attacker  is  to  select  one  of  the  three  targets 
T3,  1\,  or  rI\  at  random  and  concentrate  his  attack  on  that  target.  The 
probabilities  associated  with  these  targets  are  x7g,  As,  respectively. 

The  value  of  the  game  is 


GAMES  OF 
TIMING— DUELS 


1.  DUEL  AS  A  GAME  OF  TIMING 

The  theory  of  games  may  be  used  to  analyze  a  class  of  problems  dealing 
with  the  timing  of  decisions  in  a  competitive  environment.  In  these  prob¬ 
lems,  the  actions  which  the  players  may  take  are  given  in  advance,  but  the 
timing  of  the  actions  is  by  the  strategic  decisions  of  the  players.  Such  games 
are  characterized  by  the  following  conflict  of  interests :  each  player  wishes 
to  delay  his  decision  as  long  as  possible,  but  he  may  be  penalized  for  waiting. 
In  a  duel,  for  example,  each  duelist  wishes  to  hold  his  fire  as  long  as  possible, 
since  his  accuracy  increases  with  time.  However,  if  the  duelist  holds  his 
fire  too  long,  his  opponent  may  win  the  duel. 

Since  the  duel  is  a  good  example  of  a  game  of  timing,  we  shall  use  the  duel 
with  bullets  as  our  model  of  games  of  timing.  Thus,  we  shall  consider  an 
action  to  be  the  firing  of  a  bullet.  The  result  of  the  action  is  given  by  an 
accuracy  function  representing  the  probability  of  hitting  the  opponent  as 
a  function  of  time  of  firing.  As  in  all  games,  we  need  to  describe  the  infor¬ 
mation  available  to  the  players.  If  a  duelist  is  informed  about  his  oppo¬ 
nent’s  actions  as  soon  as  they  take  place,  we  shall  call  the  duel  a  noisy  duel. 
If  neither  duelist  ever  learns  when  or  whether  his  opponent  has  fired,  we 
shall  call  the  duel  a  silent  duel. 

2.  NOISY  DUEL:  ONE  BULLET  EACH  DUELIST 

Let  us  first  consider  the  noisy  duel  in  which  each  duelist  has  one  bullet. 
Each  duelist  is  informed  of  his  opponent’s  action,  firing  his  bullet,  as  soon 
as  it  takes  place.  Further,  let  us  assume  that  if  a  duelist  fires  and  misses, 
the  other  duelist  can  obtain  a  sure  hit  by  waiting  until  they  are  together. 
The  duelists,  starting  at  a  distance  D  apart,  approach  each  other  with  no 
opportunity  for  retreat.  The  accuracies  increase  steadily  as  the  duelists 
approach  each  other  and  ultimately  are  certainty,  or  1,  when  the  duelists 
are  breast  to  breast. 


128 


GAMES  OF  TIMING — DUELS  129 

A  strategy  for  Blue  instructs  him  when  to  fire  his  bullet  if  his  opponent 
has  not  already  fired;  and  if  the  opponent  has  already  fired  and  missed, 
then  Blue  fires  when  his  accuracy  is  1.  Thus  a  strategy  for  Blue  is  to  fire 
his  bullet  when  the  duelists  are  x  units  apart,  where  0  <  x  <  D.  Similarly, 
a  strategy  for  Red  is  to  fire  when  the  duelists  are  y  units  apart,  where 
0  <  y  <  D.  Let  the  accuracies  of  Blue  and  Red  be  Pi(x')  and  Pfy),  re¬ 
spectively.  That  is,  Pi{x)  is  the  probability  of  Blue’s  hitting  his  opponent 
if  he  fires  when  the  duelists  are  x  units  apart.  Assume  the  accuracies 
increase  as  the  distance  decreases. 

Let  the  payoff  be  +1  to  the  surviving  duelist  and  0  to  each  duelist  if 
both  survive  or  neither  survives.  The  payoff  M  (x,  y),  to  Blue,  is  his  ex¬ 
pectation  of  survival  for  his  three  possible  ranges  of  firing  times:  firing  be¬ 
fore  Red  fires,  firing  at  the  same  time  as  Red  fires,  and  firing  after  Red  fires. 
Thus  the  payoff  is  given  by 

'  Pi(x)(l)  +  [1  -  Pi(x)](-1)  =  2  Px(x)  -  1  if  x>y 

M(x  v)  =  J  Pl(*)[1  ~  ~  A(*)](-l) 

’  =  Pi(x)  —  P2(x)  if  x  =  y 

.  PtivX-l)  +  [1  -  P*fo)](l)  =  1  -  2 P2(y)  if  y  >  x. 

Since  Pi(x)  and  Pi(y)  increase  with  decreasing  values  of  x  and  y, 
respectively,  it  follows  that 

max  min  M(x,  y)  =  max  min  [2/\(x)  —  1,  I\{x)  —  P2(x),  1  —  2P-i(x)]. 

x  y  x 

Now  divide  the  interval  [0,  D]  into  three  intervals  as  follows: 

Consists  of  those 
Interval  x  for  which 

A  Pi(x)  +  P2(x )  >  1 

B  PAx)  +  P2(x)  =  1 

C  Pi(x)  +  P2(x)  <  1 

These  intervals  are  not  vacuous. 

Let 

y(x)  =  min  [2Pfx)  -  1,  P,(x)  -  P2(x),  1  -  2P2(x)]. 

Then 

max  min  M(x,  y)  =  max  y{x)  —  max  [max  n(x),  max  n(x),  max  y(x)]. 

x  y  x  i  xeA  xzB  xzC 

Now  for  all  a;  in  A,  we  have 

Pi(x)  +  P2(x)  >  1, 

from  which  it  follows  that 

1  -  2 Pt(x)  <  Pi(x)  -  P2(x)  <  2Pi(x)  -  1. 

Therefore  if  xsl,  then 


m(x)  =  1  -  2 P2(x). 
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In  interval  B,  which  is  a  point,  we  have 

Px{x)  -f  P2(x)  ■=  1. 

It  follows  that 

1  -  2 P2(x)  =  I\(x)  -  P2(x)  =  2Pi(x)  -  1. 

Therefore  if  x  e  B,  then 

n{x)  =  Pi(x)  -  P2(x). 

Interval  C  is  defined  by  those  x  for  which 
ft(m)  +  P2(x)  <  1. 

It  follows  that 

2Py(x)  -  1  <  Px{x)  -  P2(x)  <  1  -  2P2(x). 

Therefore,  for  all  x  in  C, 

n(x)  =  2Pi(x)  -  1. 

Let  x *  be  defined  by 

Pi(x*)  +  P2(x*)  -  1. 

It  follows  that 

max  /i(i)  =  1  —  2P2(x*), 

xtA 

max  jla(x)  =  P\(x*)  —  Pi(x*), 

xzB 

max  n(x)  =  2Pi(x*)  —  1. 

xtC 

Therefore,  we  have 

max  min  M (x,  y)  =  Pi(x*)  —  P2(x*) 

x  y 

where  x*  satisfies  the  equation 

Pi(x*)  +  P2(x*)  -  1. 

In  a  similar  manner  we  can  show  that 

min  max  M(x,  y)  =  Pi{y*)  —  Pi{y*) 

y  x 

where  y*  satisfies  the  equation 

Pi(j/*)  +  P.(0*)  =  I- 

We  have  thus  shown  that  M(x,  y)  has  a  saddle-point  at  x*,  y*.  The 
optimal  strategy  for  each  player  is  to  fire  when  he  is  at  a  distance  l  from 
his  opponent  given  by 

Pi©  +  ft©  -  1. 

The  value  of  the  game  is  Pi(l)  —  P2(f). 

Summary  of  solution.  The  optimal  strategy  for  the  duelists  is  to  fire 
their  bullets  simultaneously  at  a  distance  x0  which  satisfies  the  equation 

Pi(xo)  +  P2(x  0)  =  1 . 
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If  Blue  uses  this  strategy,  then  he  is  sure  of  receiving  at  least 
Pi(x0)  —  P2(x o).  If  Red  uses  this  strategy,  he  will  lose  at  most 
Pi{x0)  —  P2(xo). 

Example.  Suppose  that  Blue’s  accuracy  is  given  by  Pi(x)  =  1  —  x  and 
Red’s  accuracy  is  given  by  P2{y)  =  1  —  y2.  Then  each  duelist  should  fire 
his  bullet  at  distance  x  determined  by 

x  +  x2  =  1 

or  x  =  0.62.  The  value  of  this  duel  is  x1  —  x  =  —0.24  to  Blue  and  +0.24 
to  Red. 

If  the  two  duelists  have  the  same  accuracies,  then  they  should  fire  when 
their  accuracies  are  0.5.  The  value  of  this  duel  game  is  zero. 

3.  NOISY  DUEL:  ONE  BULLET  EACH  DUELIST,  WITHOUT  SADDLE-POINT 

In  the  previous  section  we  discussed  a  noisy  duel  which  had  a  saddle- 
point  for  its  solution,  i.e.,  each  player  had  an  optimal  pure  strategy.  The 
existence  of  the  saddle-point  depends  largely  on  the  assumption  that  the 
duelists  have  equal  worths.  In  particular  we  assumed  a  payoff  of  one  unit 
to  the  surviving  duelist,  whether  he  be  Blue  or  Red.  If  the  payoff  to  the 
surviving  duelist  depends  on  which  duelist  survives,  then  the  resulting 
duel  does  not  have  a  saddle-point.  A  duel  between  a  bomber  and  a  fighter, 
where  the  bomber  is  worth  more  than  the  fighter,  is  an  example  of  a  duel 
with  unequal  worths. 

Let  us  assume,  as  before,  that  the  two  duelists  approach  each  other. 
Their  accuracies  are  P\{x)  and  P2(t/),  respectively.  We  assume  that  the 
duel  is  noisy,  i.e.,  if  a  duelist  misses,  his  opponent  is  certain  of  a  hit.  Let 
us  assume  the  following  payoff  to  Blue: 

a,  if  Blue  alone  survives, 

/3,  if  Red  alone  survives, 

7,  if  neither  Blue  nor  Red  survives, 

0,  if  both  Blue  and  Red  survive. 

It  is  reasonable  to  assume  that  a  >  /3. 

Let  a  strategy  for  each  duelist  be  a  time  to  fire  if  his  opponent  has  not 
fired,  and  to  fire  at  the  time  when  his  accuracy  is  1  if  his  opponent  has  fired 
and  missed.  If  x  and  y  are  strategies  of  Blue  and  Red,  respectively,  then 
the  payoff  to  Blue  is: 

f  (a  -  0)Px{x)  +  0  if  X  <y, 

M{x,  y)  =  \  aPi(x)  +  +  (y  -  0  -  a)Pi(x)P2(x)  if  x  =  y, 

l  a  -  (a  -  0 )P2(y)  if  x  >  y. 

In  order  to  determine  whether  the  game  has  a  saddle-point,  it  is  nec¬ 
essary  to  evaluate  max  min  M(x,  y )  and  min  max  M(x,  y).  From  the  fact 

x  y  y  x 
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that  P i(x)  and  Pi(y)  are  monotonic  increasing  functions,  it  follows  that 

max  min  M(x,  y)  —  max  min  [(a  —  p)Pi(x)  +  p,  al’i(x)  +  fiP-iix) 

+  (T  -  P  ~  a)Pi(x)P*(x),  a  -  (a  -  /3)P2(x)], 

min  max  M(x,  y)  =  min  max  [(a  —  p)P\(y)  +  p,  aPx(y)  +  pPz{y) 

+  (7  -  P  ~  ct)Pi{y)P‘^y),  a  -  (a  -  P)Pz(y)]. 

The  functions  (a  —  jS)Pi(x)  +  3  and  a  —  (a  —  /3 )Pz{x)  are  monotonic 
increasing  and  monotonic  decreasing,  respectively,  having  a  common  value 

Zo  =  (a  —  P)Pi(xe)  +  p  =  a  —  (a  —  P)P2(x0) 

for  some  x0  for  which  Pi  (ar0)  +  Pt(x 0)  =  1.  Suppose  that  (7  —  P  —  a)  >  0. 
Then 

aPi(x 0)  +  pPi{x 0)  +  (7  —  P  —  a)Pi(x0)P 2(^0)  >  z0. 

Therefore, 

max  min  M  (x,  y)  —  Za  and  min  max  M  (x,  y)  >  z0. 

x  y  y  x 

Thus,  if  (7  —  p  —  a)  >  0,  the  game  does  not  have  a  saddle-point,  and 
therefore  no  pure  strategy  solution  for  both  sides. 

If  (7  —  P  —  a)  <0,  then  it  follows  that 

min  max  M(x,  y)  —  Zo,  max  min  M  (x,  y)  <  z0, 

y  x  x  y 

and  the  game  again  does  not  have  pure  strategy  solutions. 

The  graph  in  Fig.  17  illustrates  that  M(x,  y)  does  not  have  a  saddle- 
point.  We  assume  that  7  —  p  —  a  >  0.  The  point  P  is  defined  by 


Figure  17 
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Pi(x)  +  I\(x)  —  1  and  gives  the  max  min  M(x,  y).  The  point  Q  which 

x  y 

cannot  coincide  with  point  P  yields  the  min  max  M{x,  y ). 

v  X 

In  the  special  case  where  y  —  0  —  a  =  0,  the  game  does  have  a  saddle- 
point  at  x0  satisfying  Pi(x0)  +  P2{x0)  =  1.  This  case  was  considered  in  the 
last  section. 

We  summarize,  without  proof,  the  results  of  this  type  of  duel.  If  the 
worths  are  such  that  (y  —  0  —  a)  >  0,  then  the  game  has  a  value  which 
is  given  by 

v  —  (a  —  0)P  i(x0)  +  0, 

where  Xo  is  such  that 

Pi(xo)  +  P2(x  0)  =  1. 

Blue  has  an  optimal  pure  strategy 

F*(x)  =  In(x). 

However,  Red  has  no  optimal  strategy.  He  wishes  to  play  a  pure 
strategy  as  close  as  possible  to  x0  but  not  equal  to  x0. 

If  the  worths  are  such  that  (y  —  0  —  a)  <0,  then  Red  has  an  optimal 
pure  strategy  and  Blue  has  no  optimal  strategy. 


4.  NOISY  DUEL:  MANY  BULLETS,  EQUAL  ACCURACIES 

If  the  duelists  have  more  than  one  bullet  each,  and  if  we  assume  that 
the  worths  of  the  duelists  are  the  same,  namely,  + 1  to  the  surviving  duelist 
and  0  otherwise,  then  it  is  relatively  easy  to  compute  the  optimal  strategy. 
Suppose  that  at  time  t,  Blue  and  Red  have  mit)  and  n(t)  bullets,  respec¬ 
tively,  and  equal  accuracy  functions,  p(t).  An  optimal  strategy  for  either 
player  is  to  fire  one  bullet  whenever 

m(t)  +  n(t) 

However,  the  duelist  with  fewer  bullets,  at  any  such  time,  should  not 
fire  until 

^  m{l)  +  n(t) 

That  is,  he  should  hold  his  fire  momentarily  and  then  shoot  only  if  his 
opponent  did  not  shoot.  The  value  of  the  game  is 

_  m( 0)  —  n( 0) 

V  m(  0)  +  n(0) 

For  example,  if  Blue  has  two  bullets  and  Red  has  three  bullets  and 
both  players  use  their  optimal  strategies,  then  Red  fires  when  the  accuracy 
p(<)  =  both  fire  at  p(t)  —  J,  and  both  fire  at  p{t)  =  |,  and  v  =  —  |  to 
Blue.  In  particular,  if  pit)  =  l,  then  Red  fires  at  t  =  both  fire  at  t  —  |, 
and  both  fire  at  t  —  |. 
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5.  NOISY  DUEL:  ONE  BULLET,  ARBITRARY  ACCURACIES 

Up  to  this  point  we  have  assumed  that  the  accuracies  of  the  duelists 
increase  with  time,  and  approach  certainty.  We  now  drop  this  restriction. 
Assume  that  the  accuracies  pi(i),  p2(f)  at  time  t  >  0  are  continuous  func¬ 
tions,  not  necessarily  monotonic,  and  that  pi(f)  =  p2(f)  =  C  for  all  t  larger 
than  some  4.  We  summarize  without  proof  the  optimal  strategies  of  the 
duelists.  If  Blue  fires  at  4  and  misses,  Red  fires  at  4  for  which  p2(<)  is  a 
maximum  in  the  interval  4,  4-  For  a  pair  of  strategies  (4,  4),  the  payoff 
to  Blue  is 

f(T)  =  pi (D  -  [1  -  Pl(T)]mt(T)  if  4  <  4, 

g(T)  =  -p2(T)  +  [1  -  p*(T)]»ni(T)  if  4  <  4, 

h(T)  =  pi(D  -  p*(T)  if  4  =  4, 

where  T  —  min  (4,  f2),  m(T)  =  max  p(f).  Define 

t>T 

Fit)  =  ma  x/(r),  and  G{t)  —  ming'(r); 

T  <t  T  <£ 

then 

(i)  If  F(t)  and  G(t)  intersect  first  at  T0  >  0,  then  v  =  F(T0)  =  G(T0). 

There  exist  4  <  T0  and  4  <  T0  such  that  /(4)  =  (j(M)  —  v  and 

max  (4,  4)  =  T0.  Approximately  optimal  strategies  for  Blue  and 
Red  are  obtained  by  choosing  random  times  near  4  and  4,  respec¬ 
tively,  thus  eliminating  any  effect  of  the  function  h(T). 

(ii)  If  F( 0)  >  G{ 0),  then  v  =  med  {/ (0) ,  (?(0),  h( 0)}.  If  e  is  a  small 
positive  number  chosen  at  random,  the  solution  (4,  4)  is  (0,  e), 
(e,  0),  or  (0,  0),  depending  on  whether  the  median  is  assumed  at 
/( 0),  g( 0),  or  h( 0),  respectively. 

Example.  Suppose  4  and  the  accuracies  are 

to  =  t,  C  =  pi(t)  =  ||  -  f|,  p2(<)  =  I  -  ||  -  <|. 

This  is  case  1,  and  v  =  0,  4  =  0,  4  =  f  .  If  the  time  t  =  f  has  arrived  and 

Blue  has  erred  by  not  firing,  Red  should  re-evaluate  on  the  basis  of  the 
new  game  beginning  at  t  =  f .  This  duel  is  Case  2  and  its  value  is  —  |,  4 
=  |,  4  =  0.54. 

6.  SILENT  DUEL:  ONE  BULLET  EACH  DUELIST,  EQUAL  ACCURACIES 

In  a  silent  duel  each  duelist  knows  how  many  bullets  the  duelists  have 
at  the  start  of  the  duel,  but  each  duelist  is  ignorant  of  any  firing  by  the 
other.  Let  us  assume  that  each  duelist  has  one  bullet  at  the  beginning  of 
the  duel.  The  duelists  approach  each  other  with  no  opportunity  for  retreat. 
Let  us  assume  that  the  two  duelists  have  the  same  accuracy  functions; 
i.e.,  if  the  time  of  firing  is  the  same,  the  duelists  have  the  same  probability 
of  a  kill. 

Since  the  accuracies  increase  with  time,  each  duelist  wishes  to  postpone 
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firing  as  long  as  possible.  However,  by  postponing  his  firing  he  also  in¬ 
creases  the  probability  of  being  hit  by  his  opponent.  Further,  the  duel 
being  silent,  he  does  not  know  whether  his  opponent  has  fired.  This  con¬ 
flict  of  interests  may  be  resolved  within  the  framework  of  game  theory. 

Let  us  assume  that  the  worths  of  the  duelists  are  the  same,  and  assign 
to  Blue  the  following  values  to  the  possible  outcome : 

+  1,  if  Blue  only  survives, 

—  1,  if  Red  only  survives, 

0,  if  both  survive  or  neither  survives. 

Suppose  that  Blue  fires  when  his  accuracy  is  x  and  that  Red  fires  when 
his  accuracy  is  y,  where  0<a;<l,0<y<l;  then  the  payoff  to  Blue  is 

f  x  +  (1  —  x)y(-l)  =  -y  +  (1  +  y)x  if  a;  <  y, 

(9.1)  M(x,  y)  =  j  x(l  —  x)  +  x{\  —  x)(—  1)  =0  ii  x  =  y, 

L  2/(— 1)  +  (1  -  y)x  =  -y  +  (i  -  y)x  if  y  <  x. 

We  find  that 

max  min  M(x,  y)  —  2V2  —  3, 

x  y 

min  max  M(x,  y)  =  3  —  2  V 2. 

V  x 

Therefore  the  game  does  not  have  a  saddle-point.  Suppose  that  Blue  uses  a 
mixed  strategy  F  and  Red  uses  a  pure  strategy  y;  then  Blue’s  expectation  is 

E(F,  y)  =  £  M(x,  y)  dF(x)  =  J’~J  °  M(x,y)  dF(x) 

+  fv\0M(x,  y)dF{x) 

=  r  l~y  +  (1  +  V)x]  dF(x) 

+  f't-y  +  O--  »)*]  dF&) 

Jy+ o 

(9.2)  =  -yF(y  -  0)  +  (1  +  y)  jj  °  x  dF(x )  -  y  +  yF(y  +  0) 

+  (!“»)  fy+ o  x  dFi-x) 

=  y[F(y)  -  F{y  -  0)]  -y  +  (1  +  y)  fj'  °  x  dF(x) 

+  (!  -  y) i;+0  x  dF^)- 

Suppose  that  the  mixed  strategy  F  is  a  density  function  over  the  interval 
(a,  1),  or,  i.e.,  dF(x)  =  P(x)  dx  if  a  <  x  <  1  and  dF(x)  =  0  if  x  <  a.  Then 

-y  +  (1  +  y )  Jj  xP{x)  dx 

+  (1  —  y)  f1  xP(x)  dx  if  y  >  a 
Jy 

-y  +  (i  -  v )  £  xP(x)  dx  if  y  <  a. 


(9.3) 


E(F,  y)  = 
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Since  the  game  is  symmetric,  the  value  of  the  game  is  zero.  It  follows 
that,  if  the  game  has  a  density  function  P  for  a  solution,  then  for  all  y  for 
which  P(y)  ^  0,  E(F,  y)  =  v  =  0.  Hence,  if  the  game  has  a  density 
function  for  a  solution,  we  must  have 


(9.4)  —  y  +  (1  +  y)  fv  xP(x)  dx  +  (1  —  y)  f1  xP(x)  dx  =  0, 

J«  Jv 

independent  of  y. 

Differentiating  the  preceding  expression  twice  with  respect  to  y, 
we  obtain 

3  P(y)  +  yP'{y)  =  0. 

Now,  solving  this  differential  equation,  we  get 

(9.5)  P(y)  = 

By  substituting  (9.5)  into  (9.4),  we  have 


-y+(l+y)C  /;!  +(l-y)C  /;f  =  0. 

Hence 

-y  + C(1  +  »>(«  “  i) +  C(1  “  S)G  -  ■)  -  °. 

independent  of  y  for  a  <  y  <  1.  This  yields 

«  =  i  C  =  \. 

Now  we  shall  show  that  an  optimal  strategy  for  Blue  and  Red  is  to 
fire  when  the  accuracy  is  x  with  the  density  function  P(x)  =  l/4x3  where 
f  <  3  <  1  and  not  to  fire  before  x  =  \.  Suppose  Blue  uses  this  strategy; 
then  for  J  <  y  <  1,  we  have 


E(F*,  y)  =  —y  + 

For  y  <  we  have 

£(/*,  y )  = 


(l  +  y) 

4 


r 

hr 


dx  (1  —  ;/)  f  i  dx 
1/3  x 2  4 


rd- f-o. 

Jv  X 1 


-y  + 


(i  -  v ) 


/, 


i  dx 

1/3  x 2 


=  _22/  +  |-°- 

Therefore  the  optimal  strategy  for  Blue  and  Red  is  the  following 
mixed  strategy: 

F*(x)  =0  if  0  <  x  < 

o 

«!<*<!■ 
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7.  SILENT-NOISY  DUEL:  ONE  BULLET  EACH  DUELIST 


Mixing  the  duel,  i.e.,  letting  one  duelist  be  informed  of  his  opponent’s 
firing,  if  any,  yields  optimal  strategies  which  are  different  from  the  noisy 
or  silent  duels.  Suppose  that  Blue  is  the  silent  duelist  and  that  Red  is  the 
noisy  duelist.  Assume  that  the  two  duelists  have  the  same  accuracies. 
Then,  if  x  and  y  are  the  accuracies  at  the  time  of  firing  for  Blue  and  Red, 


respectively,  the  payoff  to  Blue  is 

f  x  -  y  +  xy 


Mix,  y)  =  7 


1  -  2  y 


for  x  <  y, 
for  x  >  y, 


0 


for  x  =  y. 


Let  a  —  V6  —  2;  then  it  can  be  verified  that  the  value  of  the  game  is 


v  =  1  -  2a  =  0.101. 


It  can  be  verified  that  Blue,  who  has  a  silent  bullet,  has  a  unique  optimal 
strategy,  namely,  the  density  function 


ro 


/(*)  = 


i _ a/2  a 

[  (x>  +  2x-  1)3/2 


for  0  <  x  <  o, 
for  a  <  x  <  1. 


The  optimal  strategy  for  Red  is  also  unique  and  is  the  same  density 
f(y)  mixed  with  the  pure  strategy  y  =  1  in  the  ratio  2/a.  In  terms  of  a 
cumulative  distribution  function,  G(y),  the  optimal  strategy  for  Red  is 


G(y)  =  2+~a  !»  dy  +  2+~ahiy)- 


8.  SILENT  DUEL:  ONE  BULLET  VERSUS  TWO,  EQUAL  ACCURACIES 

Giving  one  of  the  duelists  an  additional  bullet  considerably  complicates 
the  solution  of  the  duel,  even  for  the  simple  case  where  all  accuracies  are 
equal  and  monotonically  increasing  from  zero  to  one.  For  safest  results, 
the  duelist  with  two  bullets  should  shoot  his  two  bullets  in  separate  inter¬ 
vals  but  with  a  positive  probability  of  saving  the  second  bullet  until  the 
end.  The  duelist  with  one  bullet  defends  himself  against  each  of  his  op¬ 
ponent’s  bullets  with  two  different  density  laws,  spending  slightly  more 
than  half  of  his  fire-probability  on  the  first. 

Let  the  duelist  with  one  bullet  be  Blue,  who  chooses  a  firing  time  x, 
with  0  <  x  <  1.  Red,  who  has  two.  bullets,  chooses  two  firing  times,  y  and 
z,  with  0  <  y  <  z  <  1,  where  the  time  is  identified  with  accuracy.  The 
payoff  to  Blue  is 
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x  —  (1  —  x)y  —  (1  —  x)(l  —  y)z  for  x  <  y  <  z, 

M{x,  y,  z)  =  —  y  +  (1  —  y)x  -  (1  -  y)(l  -  x)z  for  y  <  x  <  z, 

—y  —  (1  —  y)z  +  (1  —  2/)(l  —  z)x  for  y  <  z  <  x. 

The  solution  of  this  duel  is  long  and  involved.  We  shall  summarize, 
without  proof,  the  optimal  strategies  in  this  duel.  First,  the  game  has  a 
value.  The  duelist  with  one  bullet,  Blue,  is  at  a  disadvantage  and  the 
value  of  the  duel  to  him  is 

v  f  +  f?  where  a  =  =  1.25593, 

=  -0.30650. 

Blue  also  has  a  unique  optimal  mixed  strategy,  which  is  described  by  the 
density  function 

0  if  0  <  a;  <  a, 

■  if  a  <  x  <  b, 

if  b  <  x  <  1, 
x 3 

where  the  constants  are  defined  by 

k  =  rfo"  \  =  0.13805,  l  =  - i  =  0.25760, 

1  +  2a  —  b  ’  1+2  a  —  b 

a  =  -~y  ■  =  0.28475,  b  =  - =  0.46410. 

1  +  2a  ’  1  +  2  Vf 

If  Blue  uses  this  strategy,  Red  is  forced  to  fire  his  first  bullet  between 
a  and  b,  and  his  second  bullet  after  b,  if  he  is  to  ensure  himself  the  value 
of  the  game.  Red  should  fire  his  first  bullet  with  probabilities  described 
by  the  density  function 

0 


0 
o 

where  m  =  —  —  =  0.26006. 

4  +  6a 

The  second  bullet  should  be  fired  with  a  combination  of  a  density  function 
and  a  single  step,  as  follows ; 


if  0  <  y  <  a, 
ii  a  <  y  <  b, 
if  b  <  y  <  1, 
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where 


f  0 


l  y 


if  0  <  z  <  b, 
b  <  z  <  1 
2=1. 


7  =  2  -  v"3  =  0.26795, 


=  0.40192. 


The  randomizations  on  y  and  z  are  carried  out  independently. 

Figure  18  shows  graphically  the  optimal  strategies  of  the  two  duelists. 
The  duelist  with  two  bullets  has  a  positive  probability  of  saving  his  second 
bullet  until  the  end.  This  is  shown  by  the  area  of  the  wedge  at  y  —  1.0. 
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9.  SILENT  DUEL:  POSITIVE  INITIAL  ACCURACY 

Suppose  that  the  two  duelists  have  one  silent  bullet  apiece  and  that 
their  accuracies  are  equal  and  increase  monotonically,  but  from  an  initial 
value  a  >  0  to  1.  This  situation  may  arise  if  neither  duelist  is  permitted 
to  fire  before  his  accuracy  reaches  the  value  a.  The  introduction  of  this 
constraint  modifies  the  solution  of  the  silent  duel.  We  describe,  without 
proof,  the  optimal  behavior  of  the  duelists. 

Let  f(x)  =  Jz3  be  the  probability  density  function,  which  is  also  the 
solution  of  the  silent  duel  without  constraints.  The  optimal  behavior  of  the 
duelists  will  depend  on  the  size  of  a,  as  follows: 

(i)  If  0  <  a  <  then  fire  according  to  the  probability  density  /(z) 
in  the  interval  §  <  x  <  1  and  never  fire  before  x  =  \. 

(ii)  If  I  <  o  <  h  let  (|,  b )  be  the  interval  in  which  f(x)  has  center 
of  mass  at  a.  Fire  after  b  according  to  f(x),  and  concentrate  the 
remaining  firing  probability  at  a. 

(iii)  If  I  <  a  <  1,  fire  always  at  a. 

To  illustrate  the  solution,  suppose  that  a  —  f,  which  falls  in  Case  2. 
To  determine  b,  we  need  to  solve  the  equation 

L  xf(x)  dx  _  4 

Iw-JW dx  9 

where  f(x)  =  Jo;3.  This  yields  b  =  § .  Therefore  the  optimal  strategy  for 
the  duelists  is  to  fire  according  to  the  probability  density  j{x)  =  \xz  from 
x  -  f  to  x  —  1.  This  uses  up  of  the  firing  probability.  The  remaining 
firing  probability,  is  concentrated  at  a  =  f . 

It  will  be  observed  that  the  average  firing  accuracy  in  each  of  the  three 
cases  above  is 

max  (^,  a). 

This  is  exactly  the  optimal  firing  time  for  the  associated  noisy  duel,  where 
the  duelists  have  equal  accuracy  functions  and  may  fire  after  the  accuracy 
exceeds  a. 

10.  SILENT  DUEL:  m  BULLETS  EACH  DUELIST 

Assume  equal  monotonic  accuracy  functions  with  each  duelist  having 
m  bullets.  Assume  that  the  duel  is  silent.  Then  the  optimal  strategy  is  to 
fire  the  (m  +  1  —  fc)th  bullet  in  the  interval 
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according  to  the  inverse  cubic  law  \kxs.  The  value  of  the  game  is  zero. 

It  is  interesting  to  note  that  the  solution  of  the  noisy  duel  with  many 
bullets  requires  the  shooting  of  the  (m  +  k  —  l)th  bullet  at  the  harmonic 
mean,  of  the  same  interval. 


11.  SILENT  DUEL:  STRICTLY  MONOTONIC  ACCURACIES 


We  now  drop  the  restriction  of  equal  monotonic  accuracy  functions. 
We  consider  the  silent  duel  in  which  the  accuracies  are  arbitrary  but 
strictly  monotonic.  Each  duelist  has  only  one  bullet. 

Let  pi (<)  and  p2(f)  be  the  probabilities  of  a  hit  by  Blue  and  Red, 
respectively.  Let  pi(0)  =  p2(0)  =  0  and  pi(l)  =  1-  Define 


m  = 


V2  (t) 

pi(0p!(0’ 


g(t)  = 


vM. 


Let  ai  and  a2  be  defined  by  the  following  equations: 


f'J — i, 

Jaz  Z 


and  let 


a  =  max  (oi,  02)- 


We  now  summarize  the  optimal  strategy  in  terms  of  these  defined 
quantities.  The  optimal  strategy  for  Blue,  whose  accuracy  function  is  pi(t), 
is  to  fire  his  bullet  according  to  the  following  cumulative  distribution 


function: 


0  for  0  <  t  <  a, 


for  a  <  t  <  1, 
for  t  =  1. 


The  optimal  strategy  for  Red,  whose  accuracy  function  is  p2(<)i  is  given 
by  a  similar  cumulative  distribution  function: 


f  0  for  0  <  t  <  a, 


G(t)  = 


u 


for  a  <  t  <  1, 
for  t  =  1. 


In  other  words,  Blue  has  a  jump  at  1  if  ai  <  a. 
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The  value  of  the  game  is 


'  1  -  3 rh(t) 

1  +  Pz{a) 

—  1  +  3pi(a) 
1  +  pi(a) 


if  a  =  oi, 
if  a  =  a2. 


Example.  Let  Blue  have  an  accuracy  function  p\(x)  =  x  and  Red  have 
an  arbitrary  accuracy  function  pi(x)  =  p(x).  Let  a,  @  be  the  probabilities 
that  Blue  and  Red,  respectively,  fire  at  the  time  when  their  accuracies 
are  1.  Table  3  summarizes  the  solution  for  various  forms  of  p{x). 


Table  3.  Silent  Duel,  Monotonic  Accuracies 


p(x) 

a 

t 

a 

P 

X 

0.333 

0 

0 

0 

2x2 

1  +x2 

0.409 

0.1021 

0 

0.0764 

x(3  —  x) 

2(2  -  x) 

0.372 

0.0838 

0.0063 

0 

X 

2  -  x 

0.414 

0.1720 

0 

0 

X 2 

0.481 

0.2481 

0 

0.0729 

2X3 

4x2  —  3x  +  1 

0.415 

0.0280 

0 

0.1741 

12.  SILENT  DUEL:  CONTINUOUS  FIRE 

If  a  duelist  is  permitted  to  vary  his  rate  of  fire  along  a  continuous  scale 
between  zero  and  one,  then  the  silent  duel  with  continuous  fire  has  the 
property  that  every  mixed  strategy  is  dominated  by  a  pure  strategy. 

Let  R(t)  be  the  rate  of  fire  and  A(t)  be  the  accuracy  density  function, 
i.e.,  the  probability  of  a  kill  by  a  unit  amount  of  fire  in  a  unit  time  at  time 
t.  Then  the  probability  of  the  opponent’s  survival  over  the  interval  (0,  t)  is 

P{R,  t)  =  exp  |  —  J*  A(u)R(u)  dwj, 

where 

0  <  R(t)  <  1,  £  R(t)  dt  =  0  <  1, 

and  13  is  the  total  amount  of  ammunition. 

If  S  is  the  set  of  all  strategies  R,  then  a  mixed  strategy  is  a  distribution 

function  F  where  js  dF(R )  =  1.  If  the  duelist  is  not  killed  by  time  t,  the 
probability  of  his  opponent’s  survival  is 

<t>(F,  t)  =  fs  exp  (  -  f‘  A(u)R(u)  du}  dF(R). 
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Corresponding  to  any  mixed  strategy  F ,  there  is  a  pure  strategy 

Rf  =  JSR  dF(R ) 

which  is  uniformly  better  than  F ,  independent  of  the  opponent  s  action. 


13.  TARGET  PREDICTION 

A  classic  military  problem  is  how  best  to  aim  at  a  mobile  target  which 
is  deliberately  maneuvering  so  as  to  confound  prediction  of  its  position. 
The  mobile  target  may  be  a  ship,  plane,  or  infantryman.  Their  attacker 
may  be  a  bomber,  an  antiaircraft  gun,  or  a  sniper,  respectively.  In  each 
case,  there  is  a  time  lag  between  the  detection  of  the  target  and  the  arrival 
of  the  projectile. 

Suppose  a  battleship  in  midocean  is  aware  of  an  enemy  bomber’s  pres¬ 
ence,  but  the  plane  is  too  high  for  the  battleship  to  take  any  offensive 
measures  against  the  plane.  However,  the  battleship  can  maneuver  in  order 
to  confound  the  prediction  of  its  position.  The  ship  is  interested  only  in 
not  being  hit.  The  plane  has  one  bomb  and  let  us  assume  that  the  bomber’s 
aim  is  excellent,  but  there  is  a  time  lag  between  the  release  of  the  bomb 
and  its  detonation.  That  is,  the  bomber  must  aim  at  an  anticipated  position 
of  the  ship. 

In  order  to  gain  insight  into  this  difficult  problem  it  is  necessary  to 
simplify  the  problem  further  by  assuming  the  ocean  to  be  one-dimensional 
and  discrete.  Let  us  assume  that  the  battleship  is  located  on  one  of  a  long 
row  of  points  and  that  at  each  unit  of  time  it  moves  either  one  unit  to  the 
left  or  one  unit  to  the  right.  Let  us  assume  that  the  time  lag  is  2  units  or, 
what  is  the  same  thing,  2  moves.  The  payoff  to  the  bomber  is  1  if  he  hits 
the  battleship,  and  zero  otherwise. 

A  strategy  for  the  battleship  will  depend  on  prior  moves.  Since  the 
battleship’s  course  more  than  two  moves  ago  is  known  to  his  opponent, 
it  is  reasonable  to  suppose  that  this  dependence  will  not  reach  very  far 
back.  Let  us  suppose  that  the  choice  depends  on  the  previous  move  only. 
Now  suppose  that  the  battleship  moves  in  the  opposite  direction  with 
probability  x  and  moves  in  the  same  direction  with  probability  l  —  x.  At 
the  end  of  2  moves  the  battleship  will  be  located  at  one  of  three  positions. 
The  probabilities  associated  with  these  three  positions  are 


M  = 


(1  -  *)2 
X 

m(l  —  x). 


From  the  description  of  the  game  it  follows  that  M  represents  the  expected 
payoff. 
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If  we  restrict  ourselves  to  these  strategies,  then  an  optimal  strategy  for 
the  battleship  is  a  choice  x  which  makes  the  maximum  of  these  three 
probabilities  a  minimum.  This  occurs  at 

x  —  (1  —  x)2, 

or  at 


It  can  be  shown  that  the  value  of  the  game  is  given  by  x*  =  (3  —  VB)  /2 
and  that  x*  is  also  the  optimal  strategy  for  the  battleship.  Further,  this 
optimal  strategy  is  unique.  On  the  other  hand,  it  can  be  shown  that  the 
bomber  does  not  possess  an  optimal  strategy.  However,  the  bomber  has 
an  e-optimal  strategy.  That  is,  for  any  «  >  0,  there  is  a  mixed  strategy  for 
the  bomber  which  assures  him  of  a  hit  with  probability  >  x*  —  e, 
but  no  strategy  guarantees  x*. 


TACTICAL 
AIR-WAR  GAME 


1.  INTRODUCTION 

The  problem  of  optimal  employment  of  tactical  air  forces  in  the  various 
theater  air  tasks,  like  many  other  military  questions,  can  be  analyzed  as 
a  multimove  game  between  two  sides.  In  the  formulation  of  the  game  we 
view  the  tactical  air  war  as  consisting  of  a  series  of  strikes,  which  are  the 
moves  of  the  game.  Each  move  is  an  allocation  of  resources  among 
various  tasks.  Among  the  usual  tasks  are  the  following: 

Counter  air.  These  operations  are  against  the  enemy’s  theater  air  base 
complex  and  organization  in  order  to  destroy  his  aircraft,  personnel, 
facilities,  etc. 

Air  defense.  These  represent  air-defense  operations  against  the  enemy’s 
counter-air  operations. 

Ground  support.  The  targets  for  ground-support  operations  are  concen¬ 
trations  of  enemy  troops  or  fortified  positions,  attacked  in  order  to  help 
the  ground  forces  in  the  battle  area.  This  is  accomplished  by  aerial  delivery 
of  fire  power  against  the  enemy  ground  targets.  We  also  include  inter¬ 
diction,  reconnaissance,  and  airlift  in  this  ground-support  task. 


2.  FORMULATION  OF  TACTICAL  GAME 

We  assume  that  the  tactical  air-war  game  consists  of  a  series  of  strikes 
or  moves.  Each  strike  consists  of  simultaneous  counter-air,  air-defense,  and 
ground-support  allocation  by  each  side.  Let  us  assume  that  at  the  start  of 
the  game  Blue  has  p  planes  and  Red  has  q  planes.  Let  us  look  at  a  strike  in 
the  game,  say  the  initial  strike.  Suppose  that  on  this  strike  Blue  dispatches 
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x  planes  on  counter-air  operations  and  u  planes  on  air-defense  operations, 
and  the  remaining  amount,  p  —  x  —  u  planes,  on  ground-support  opera¬ 
tions.  Similarly,  suppose  that  for  his  first  strike  Red  allocates  y  planes  to 
counter  air,  w  planes  to  air  defense,  and  the  remaining  number,  q  —  y  —  w 
planes,  to  support  his  ground  forces.  For  this  initial  strike  and  for  any 
future  strikes,  the  above  decisions  are  made  by  each  side  in  ignorance  of 
the  allocation  of  the  opposing  side.  It  is  assumed,  however,  that  each  side 
knows  the  number  p  and  q  of  planes  that  both  have. 

Since  Red  allocates  w  planes  to  air  defense  we  can  expect  a  reduction 
in  the  number  of  Blue’s  planes  that  get  through  to  counter-air  targets. 
The  number  of  interceptions  by  Red  will  be  proportional  to  w,  say  cw, 
unless  Blue’s  attacking  planes  are  saturated.  This  proportionality  con¬ 
stant,  or  defense  potential,  depends  on  the  planes’  characteristics  and 
flying  altitudes,  and  on  their  weapons’  characteristics.  Hence  the  number 
of  Blue  attacking  planes  that  penetrate  Red’s  defenses  is  x  —  cw  as  long 
as  cw  is  not  larger  than  x.  If  cw  is  larger  than  x,  no  Blue  aircraft  will 
penetrate.  Therefore  the  number  of  Blue  attacking  planes  that  penetrate 
Red’s  defenses  is  the  larger  of  the  two  numbers  x  —  cw  and  0,  or 

number  Blue  planes  penetrating  =  max  (0,  x  —  cw). 

The  objective  of  Blue’s  counter-air  operations  is  to  reduce  the  enemy’s 
air  force  by  dropping  bombs  on  certain  targets,  and  the  number  of  air¬ 
craft  destroyed  will  vary  with  the  number  of  attacking  planes  that  pene¬ 
trate  Red’s  defenses.  Increasing  the  number  of  Blue’s  penetrating  planes 
will  diminish  the  enemy’s  air  force.  If  we  assume  that  each  of  Blue’s 
penetrating  planes  can  destroy  b  planes  of  the  enemy,  then  Blue  could 
destroy  b  max  (0,  x  —  cw)  Red  planes,  if  Red  had  this  number  of  aircraft 
at  risk  at  the  time  of  Blue’s  counter-air  operations.  The  proportionality 
constant  b  depends  on  the  target  as  well  as  the  aircraft  characteristics. 

Suppose  that  during  this  initial  strike  Red’s  air  force  is  increased  by  s 
replacements.  Suppose  also  that  of  the  q  planes  that  Red  uses  on  his 
operations,  aq  of  them  survive  antiaircraft  fire  and  accidents.  Now  let  us 
assume  that  Red  has  s  +  aq  planes  at  risk  at  the  time  of  Blue’s  counter-air 
operations.  Then  we  have 

Red  planes  left  =  max  {0,  aq  +  s  -  min  [s  +  aq,  b  max  (0,  s  -  cw)]} 

(10.1)  —  max  (0,  max  [0,  aq  +  s  —  b  max  (0,  x  —  cw)]} 

=  max  [0,  aq  +  s  —  b  max  (0,  x  —  cw)]. 

In  exactly  the  same  manner  we  can  analyze  the  effect  of  the  initial 
strike  on  Blue’s  inventory.  At  the  end  of  the  initial  strike, 

(10.2)  Blue  planes  left  =  max  [0,  dp  +  r  -  e  max  (0,  y  -  fu)\, 
with  the  constants  d,  e,  f  defined  similarly  to  a,  b,  c,  respectively. 
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3.  PAYOFF  OF  TACTICAL  GAME 

Let  us  look  at  Blue’s  employment  of  theater  air  forces  during  the  cam¬ 
paign.  We  assume  that  his  objective  is  to  assist  the  ground  forces  in  the 
battle  area,  and  the  outcome  of  this  assistance  will  vary  with  the  number 
of  planes  he  allocates  to  ground-support  operations.  We  assume  that  it  is 
possible  to  construct  for  Blue  a  payoff  function,  giving  the  payoff  for  each 
strike  of  the  campaign,  in  terms  of  the  distance  advanced  by  the  ground 
forces  which  is  a  function  of  the  number  of  planes  allocated  to  ground 
support.  This  function  depends  heavily  on  the  characteristics  of  the 
ground-support  targets — i.e.,  on  the  degree  of  concentration  of  troops, 
vehicles,  and  materiel,  and  on  the  fortification  of  positions. 

Now  if  Blue’s  ground  forces  must  advance  while  being  subjected  to 
Red’s  ground-support  sorties,  Blue’s  yield  in  ground  support  is  reduced  in 
accordance  with  the  number  of  planes  allocated  by  Red  to  close-support 
missions.  If  q  —  y  —  w  is  the  function  that  measures  the  distance  gained 
by  Red’s  ground  forces,  then  the  net  advance  of  Blue’s  ground  forces,  if 
he  allocates  p  —  x  —  u  planes  to  ground  support  and  Red  allocates 
q  —  y  —  w  planes  to  ground  support,  can  be  written  as 

p  —  x  —  u  —  (q  —  y  —  w). 

The  foregoing  expression  represents  the  payoff  to  Blue  for  this  one 
period  or  one  strike.  The  payoff  for  the  entire  campaign  of  N  strikes  is  the 
sum  of  these  net  yields  for  each  of  the  N  strikes,  or 

N 

(10.3)  M  =  2,  [p  —  x  —  u  —  (q  —  y  —  u?)]. 

The  problem  faced  by  each  side  is  now  apparent.  For  example,  Blue 
would  like  to  allocate  a  large  number  of  planes  to  ground-support  missions 
and  thereby  increase  his  payoff  at  a  given  move,  yet  he  would  like  to  de¬ 
stroy  the  Red  air  force  by  means  of  counter-air  operations  in  order  to 
ensure  that  q  is  small,  or  zero,  for  subsequent  moves.  Further,  if  he  does 
not  provide  for  air  defense  he  may  suffer  severe  losses  to  his  own  air  force 
if  Red  elects  to  mount  a  large  counter-air  strike.  Each  player  has  to  take 
into  account  the  future  and  the  possibilities  open  to  his  opponent. 


4.  TWO  TASKS— COUNTER  AIR  AND  GROUND  SUPPORT 

Let  us  first  study  the  special  tactical  air  war  model  with  two  tasks, 
counter  air  and  ground  support,  and  omit  the  air-defense  task.  This  is 
equivalent  to  assuming  that  the  air-defense  potentials  c  and  /  are  zero. 
Equations  (10.1)  and  (10.2)  now  read 
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(10.4)  5i  =  max  (0,  aq  —  bx  +  s), 

(10.5)  pi  =  max  (0,  dp  —  ey  +  r), 
and  the  payoff  (10.3)  reduces  to 

(10.6)  M  =  2  \{p  -  x)  -  (q  -  y)]. 

It  will  be  convenient  to  number  the  moves  from  the  end  of  the  game — 
i.e.,  the  nth  move  means  n  moves  to  the  end  of  the  game.  The  tactical  air 
war  game  may  now  be  described  as  follows.  Let  pn  and  qn  be  the  forces 
possessed  by  Blue  and  Red,  respectively,  and  known  by  both  Blue  and  Red 
at  the  start  of  a  game  of  n  moves.  Suppose  that  at  the  initial  move  of  this 
n-move  game  Blue  allocates  xn  planes  to  counter  air  where  xn  <  pn,  and 
simultaneously  Red  allocates  yn  planes  to  counter  air  where  y„  <  qn.  Then 
the  forces  possessed  by  Blue  and  Red  for  the  ( n  —  l)-move  game  are 
given  by 

(10.7)  pn- i  =  max  (0,  dpn  —  eyn  +  r„), 

(10.8)  g„_i  =  max  (0,  aqn  —  bxn  +  s„). 

Choices  x„_i  and  yn-i  are  then  made  for  this  (n  —  l)-move  game.  This 
process  is  continued  for  the  duration  of  the  game  of  N  moves.  The  payoff 
(10.6)  now  becomes 

(10.9)  M  =  S  [(p»  -  Xn)  ~  (<Zn  -  2/n)]. 

n  =  1 

The  game  as  described  has  a  value  and  optimal  strategies.  We  may 
expect  the  value  of  the  game  to  depend  on  the  number  of  moves  and  the 
size  of  initial  forces.  Define  i»,-(p,-,  q%)  to  be  the  value  of  the  game  of  i  moves 
in  which  Blue  has  p,  forces  and  Red  has  qt  forces  at  the  start  of  this  f-move 
game.  In  order  to  obtain  the  optimal  initial  move  of  an  n-move  game, 
i.e.,  the  optimal  values  of  x„  and  yn,  it  is  sufficient  to  solve  the  game  whose 
payoff  is  given  by 

(10.10)  M„(xn,  y r.)  pn  Xn  (^n  y li)  Vji— 1  (Pn— 1,  q n-- 1) , 

where  p„_i  and  qn- 1  are  given  by  (10.7)  and  (10.8),  respectively.  Equation 
(10.10)  defines  the  payoff  in  a  game  of  where,  on  the  initial  move  of  the 
n-move  game,  Blue  chooses  xn  and  Red  chooses  yn,  and  then  both  players 
play  optimally  for  the  remaining  (n  —  1)  moves  of  the  game. 


5.  OPTIMAL  TACTICS  FOR  TWO  TASKS 

From  (10.10)  it  is  clear  that  in  order  to  solve  the  n-move  game,  we 
first  need  to  solve  the  (n  —  l)-move  game,  which  in  turn  requires  a  solution 
of  the  (n  —  2)-move  game,  etc.  Therefore  we  shall  solve  the  n-move  game 
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by  starting  at  the  last  move  of  the  game  and  working  backwards  to  the 
first  move. 

Let  v0(po,  go)  =  0.  Then  from  (10.10)  we  have  a  one-move  game  or 
n  =  1,  i,. 

(10.11)  Mi(xh  yi)  =  pi  -  Xi  -  qi  +  y\. 

Clearly,  the  optimal  values  of  xx  and  y\  are  x*  =  0,  y*  —  0.  Blue  and  Red 
should  allocate  no  resources  to  counter  air.  That  is,  the  optimal  tactics  for 
Blue  and  Red  on  the  last  move  are  to  allocate  all  their  resources,  px  and  gi, 
respectively,  to  ground  support.  The  value  of  the  one-move  game  is 
given  by 

(10.12)  vx(ph  qi )  =  pi  —  Qi. 

Having  solved  the  one-move  game,  we  now  proceed  to  solve  the 
two-move  game.  The  payoff  for  n  —  2  is  given  by 

(10.13)  M2{x 2,  y2)  =  Pi  —  Xi  —  g2  +  2/2  +  Vi(Ph  Qi) 

=  P2  -  x2  -  g2  +  y2  +  Pi  —  gi, 

where  px  and  gi  satisfy  the  following  equations: 

(10.14)  pi  =  max  (0,  dp2  —  ey2  +  r2), 

(10.15)  gi  =  max  (0,  aq2  —  bx2  +  s2). 

Substituting  these  values  of  pi  and  qx  into  (10.13),  we  obtain 


(10.16) 


M2(x2,y2)  = 


P2  —  q2  —  X2  +  y2 

(1  +  d)p2  —  q2  —  x2 
+  1/2(1  -  e)  +  r2 

(1  +  d)p2  -  (1  +  a)q2 
—  (1  -  b)x 2  +  (1  -  e)y2 
+  r2  —  s2 


p2  -  (1  +  a)q2 
.  —  (1  -  b)x 2  +  y2  -  s2 


if  x2  > 
if  x2  > 

if  x2  < 

if  x2  < 


aq2  +  s2 
b  ’ 

aq2  ~b  s 2 


ag2_+_S2, 


aq2  +  s2 
b 


y 2  > 
2/2  < 

2/2  < 

2/2  > 


dp2  +  r2 
e 

dp2  +  r2 
e 

dp2  +  r2 
e 

dy2  +  r2 
e 


The  solution  of  the  above  game  whose  payoff  M2(x 2,  y2)  is  defined  in 
(10.16)  will  depend  on  the  values  of  the  parameters  a,  b,  d,  and  e.  First, 
suppose  that  b  >  1  and  e  >  1.  Then  the  optimal  tactics  are  easily  com¬ 
puted,  and  are  given  by 

o(72  +  s2  *  dp2  +  r2 
x*2  =  -X-j—  -  2/2  = 

The  value  of  the  game  is 


(10.17)  v2=  (l  +  ~^p2  -  (l  +  “)g2  +  g  - 
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If  b  <  1  and  e  <  1  in  (10.16),  then  the  optimal  tactics  are 

z?  =  0,  yt  =  0. 

In  this  case,  the  value  of  the  game  is 

(10.18)  v2  =  (1  -f-  d)p2  —  (1  +  a)q2  +  Ti  — 

The  optimal  tactics,  x2  and  y2,  tell  Blue  and  Red  how  they  should 
allocate  their  forces  among  the  two  tasks  on  the  next  to  the  last'  strike. 
Of  course,  on  the  last  strike  of  the  campaign  we  have  already  shown  that 
Blue  and  Red  should  allocate  all  their  forces  to  ground  support. 

Having  computed  the  optimal  tactics  for  n  —  1  and  n  =  2,  we  next 
proceed  to  compute  the  optimal  tactics  for  n  =  3  (three  moves  to  the  end 
of  the  game).  We  set  n  =  3  in  (10.9),  which  gives  us  the  payoff  function 

(10.19)  M3(x3,  y3)  =  pz  -  x3  —  q3  +  y3  +  v2(p2,  q2), 
where 

p2  =  max  (0,  dp3  —  ey3  +  r3), 

.  q-i  =  max  (0,  aq3  —  bx3  +  s3), 

and  v2(pi,  q2)  is  given  by  (10.17)  or  (10.18),  depending  upon  the  parameters 
b  and  e.  Solving  the  game  whose  payoff  function  is  defined  by  (10.19),  we 
can  obtain  the  optimal  tactics  for  a  three-move  game  and  the  game  value 
v3(p3,  q3). 

Proceeding  in  the  above  manner,  we  build  up  the  solution  of  the  game 
for  any  number  of  moves.  It  turns  out  that,  independent  of  the  parameters, 
initial  conditions,  and  size  of  forces  at  a  given  move,  both  sides  have  op¬ 
timal  pure  strategies.  Although  every  strike  by  a  player  is  made  simul¬ 
taneously  with  his  opponent,  nevertheless,  a  player  never  needs  to  random¬ 
ize.  Thus  an  optimal  strategy  for  a  player  can  be  specified  by  giving,  for 
each  strike  of  the  campaign,  either  the  number  of  planes  he  allocates  to 
counter-air  operations  or  the  number  of  planes  he  allocates  to  ground  sup¬ 
port.  These  optimal  allocations  depend  on  the  attrition  parameters  a,  b, 
d,  e,  and  on  the  number  of  strikes  remaining  in  the  campaign. 

Before  we  describe  the  optimal  strategies  obtained  by  the  above  pro¬ 
cedure,  let  us  introduce  some  notation.  From  Equation  (10.17)  it  is  clear 
that  if  Red  had  enough  forces  he  could  annihilate  Blue’s  forces  by  allocating 

dpn  +  r„ 

2/„=  e 

to  counter-air  operations.  Any  allocation  exceeding  this  is  clearly  wasteful. 
Thus  it  is  reasonable  to  call  an  allocation 


yn  =  min  (qn,  dj>n  + 
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a  concentration  on  counter  air.  We  shall  denote  such  an  allocation  by  A. 
Similarly  the  allocation  by  Blue  of 

■  l  aqn  +  s„\ 
x„  =  mm  I  p„, - r - 1 

is  a  concentration  on  counter  air  by  Blue  and  will  also  be  denoted  by  A. 

If  Blue’s  allocation  to  counter  air  is  x„  —  0,  and  therefore  all  forces  p„ 
are  allocated  to  ground  support,  we  shall  denote  this  tactic  by  G.  Similarly, 
if  yn  =  0,  then  Red  concentrates  on  ground  support,  and  this  is  denoted 
by  G. 

Finally,  consider  an  allocation  which  is  neither  A  nor  G.  We  shall 
denote  this  by  the  symbol  (A,  G).  In  such  a  tactic  a  player  splits  his 
forces  between  counter  air  and  ground  support,  and  does  not  concentrate 
on  either  task. 

The  optimal  strategies  depend  on  the  attrition  parameters.  Let  us  first 
suppose  that 

a  +  b  >  1  and  d  +  e  >  1. 

Then  it  turns  out  that  the  optimal  strategy  for  each  player  requires  him 
to  begin  the  campaign  with  a  series  of  allocations  A,  and  to  end  with  a 
series  of  allocations  G.  The  points  at  which  the  players  shift  from  A  to  G 
will,  in  general,  be  different  for  Red  and  Blue. 

The  precise  points  of  shift  depend  on  the  magnitudes  of  the  attrition 
parameters,  in  the  following  manner.  Let  /  be  the  largest  integer  for  which 
the  inequality 


holds,  and  let  g  be  the  largest  integer  for  which  the  inequality 


1 

b 


1  —  aa 
1  —  a 


>  0 


holds.  These  integers/  and  g  determine  the  strike  at  which  the  shift  is  made. 

The  optimal  strategies  obtained  by  the  method  described  above  are 
summarized  in  Table  4.  In  this  table,  the  integer  t  denotes  another  shift 
point  which  is  applicable  in  cases  2  and  3. 


6.  OPTIMAL  TACTICS  FOR  THREE  TASKS 

We  now  return  to  the  more  general  model  with  all  three  tasks — counter 
air,  air  defense,  and  ground  support — present.  We  shall  see  that  increasing 
the  number  of  air  tasks  to  three  leads  to  substantial  changes  in  the 
character  of  the  optimal  tactics. 

In  order  to  simplify  the  analysis  we  assume  that  Blue  and  Red  have  the 
same  air-defense  potential:  each  plane  allocated  to  defense  can  prevent  one 


Table  4.  Optimal  Allocation  of  Forces  Between  Two  Tasks 


Case 

No. 

Range  of  parameters 

Player 

Optimal  allocation  at  move  number  n 
{counting  from  end  of  campaign) 

1  <  n  <  min  (J  +  1,  g  +  1) 

min  (/  +  2,  <7  +  2)  <  n  <  t 

n  =  t  +  1 

t  +  2<n<N 

1 

a  +  6  >  1,  d  +  e  >  1 

Blue 

G 

G 

A 

A 

f  <0 

Red 

G 

A 

A 

A 

Blue 

G 

A 

A 

A 

9  <J 

Red 

G 

G 

A 

A 

Blue 

G 

A 

A 

A 

g  -  g 

Red 

G 

A 

A 

A 

2 

Gt  ~b  6  ^  1,  d  6  ^  \ 

Blue 

G 

G 

A 

{A,  G) 

(g  =  00 ) 

Red 

G 

A 

A 

A 

3 

a  -j-  b  >  1,  d  -|-  e  <  1 

Blue 

G 

A 

A 

A 

(/  =  °° ) 

Red 

G 

G 

A 

(A,  G ) 

4 

a  +  6  ^  1,  d  -f-  e  ^1 

Blue 

G 

G 

G 

G 

(/  =  <?  =  «) 

Red 

G 

G 

G 

G 
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attacking  plane  from  reaching  the  target — that  is,  we  assume  that  c  =  /  =  1. 
We  also  assume  that  each  attacking  plane  that  penetrates  the  defense 
can  destroy  one  plane  in  an  airfield  strike,  or  b  =  e  =  1,  and  that  losses 
due  to  aborts,  accidents,  and  antiaircraft  fire  are  negligible,  or  a  =  d  —  1. 
Finally,  we  assume  that  replacements  are  absent,  i.e.,  r  =  s  =  0.  Then  the 
inventory  of  planes  at  the  end  of  a  strike  will  be,  for  Blue  and  Red, 
respectively, 

pi  —  max  [0,  p  —  max  (0,  y  —  m)], 
qi  =  max  [0,  q  —  max  (0,  x  —  ip)]. 

If  we  derive  the  optimal  strategies  in  the  three-task  model  we  find  that 
they  are  different  from  the  two-task  model  in  the  following  two  important 
ways:  first,  the  optimal  tactics  depend  upon  the  relative  strengths  of  the 
two  sides;  second,  optimal  play  requires  one  player  to  use  a  mixed  strategy. 

The  techniques  used  to  derive  the  optimal  tactics  are  similar  to  those 
used  for  the  two-task  model,  but  are  more  complicated.  We  shall  omit  the 
derivation ;  however,  the  reader  can  verify  the  given  results.  We  shall  give 
a  complete  description  of  the  optimal  employment  of  forces  in  terms  of  the 
number  of  strikes  remaining  and  the  relative  strengths  of  the  two  sides. 
In  describing  the  optimal  allocations  we  shall  always  assume  that  at  the 
move  in  question  Blue  is  the  stronger  side,  that  is  to  say,  p  >  q.  This  is 
merely  a  convention  to  facilitate  the  description  of  optimal  tactics,  and  is 
not  meant  to  imply  that  the  side  which  is  the  stronger  at  a  given  stage  of 
the  game  will  always  remain  the  stronger  for  all  subsequent  moves.  Of 
course,  if  a  player  is  initially  stronger  and  plays  optimally,  then  he  will 
remain  the  stronger  throughout  the  game. 

First,  we  give  a  qualitative  description  of  the  optimal  strategies: 

1.  Campaign  ends  with  ground  support.  The  campaign  always  ends  with 
a  series  of  strikes  on  ground  support— i.e.,  during  the  closing  period  of  the 
campaign  both  Red  and  Blue  concentrate  all  their  forces  on  ground-support 
missions.  In  this  terminal  period  both  sides  have  the  same  optimal  tactics, 
regardless  of  their  resources.  If  we  assume  that  c=f=b  =  e—  1,  then 
this  terminal  period  consists  of  only  the  last  two  strikes  of  the  campaign. 

2.  Blue  ( stronger )  splits  his  forces.  At  all  times  other  than  the  closing 
phase  of  the  campaign,  Red  and  Blue  have  very  different  optimal  tactics. 
During  any  of  these  early  strikes,  the  stronger  side,  say  Blue,  has  a  pure 
strategy,  i.e.,  there  exists  a  best  allocation  of  Blue’s  air  force  among  the 
three  air  tasks.  In  this  connection,  there  is  a  critical  value  (about  2.7)  of 
the  ratio  of  the  Blue  force  to  the  Red  force  which  governs  Blue’s  allocation 
during  the  early  period  in  the  following  manner:  If  the  force  ratio  is  less 
than  this  critical  value,  then  the  optimal  allocation  in  the  early  period  con¬ 
sists  of  splitting  the  stronger  force  between  two  tasks,  counter  air  and  air 
defense,  and  neglecting  the  ground-support  task.  The  size  of  the  split  de- 
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pends  on  the  relative  strengths  of  the  two  forces  and  the  number  of  strikes 
left  in  the  campaign.  However,  if  Blue’s  strength  relative  to  Red’s  is 
greater  than  the  critical  value,  then  regardless  of  his  strength,  Blue  should 
divide  his  force  in  a  fixed  way  among  the  three  tasks,  counter  air,  air 
defense,  and  ground  support.  The  number  of  aircraft  allocated  to  each 
mission,  however,  is  still  dependent  on  the  number  of  strikes  remaining. 

3.  Red  ( weaker )  mixes  his  tactics  and  concentrates  his  forces.  The  weaker 
combatant  cannot  use  a  single  strategy,  but  must  bluff  during  all  the  strikes 
other  than  those  of  the  terminal  phase.  Unlike  his  opponent,  the  weaker 
combatant  does  not  have  a  single  allocation  that  is  best.  He  must  use  a 
mixed  strategy  and  gamble  for  high  payoffs.  If  he  is  not  too  weak  -i.e.,  if 
the  force  ratio  is  less  than  the  critical  value — then  he  concentrates  his  en¬ 
tire  force  either  on  counter  air  or  on  air  defense;  but  which  of  these  tasks 
receives  the  full  effort  is  decided  by  some  chance  device.  However,  if  Red 
is  very  weak  (force  ratio  larger  than  the  critical  value),  then  he  allocates 
his  entire  air  force  to  any  one  of  the  three  air  tasks  with  the  particular  task 
again  chosen  at  random.  In  other  words,  if  a  player  is  very  weak  relative 
to  the  opponent,  then  he  takes  a  chance  on  an  early  payoff.  Of  course,  to 
be  most  effective,  he  must  bluff  correctly— i.e.,  the  random  device  should 
select  the  tasks  with  the  proper  relative  frequencies. 

4.  Mix  and  split  the  same  tasks.  It  is  of  interest  to  note  that  on  each 
strike  Red,  the  weaker  side,  bluffs  with  the  same  tasks  that  Blue  uses  in 
his  allocation.  Thus  if  Red  is  very  weak  he  bluffs  with  each  of  the  three 
tasks.  However,  if  Red  is  moderately  weak,  he  bluffs  with  two  tasks— 
counter  air  or  air  defense — and  Blue  splits  his  forces  between  the  same 
two  tasks. 

5.  Blue’s  defense  decreases  during  campaign.  As  noted  above,  prior  to 
the  closing  phase  of  the  campaign  Blue  splits  his  forces  among  his  air  tasks. 
The  actual  split  is  a  function  of  the  force  sizes  of  Blue  and  Red  and  the 
number  of  strikes  left  in  the  campaign.  However,  as  the  campaign  proceeds, 
the  fraction  of  Blue’s  force  allocated  to  air  defense  will  decrease.  At  the 
same  time,  the  fraction  allocated  by  Blue  to  counter  air  will  increase. 
During  this  time,  the  chance  that  Red  will  attack  Blue  also  decreases,  but 
the  chance  that  Red  will  defend  himself  increases. 

6.  Blue’s  defense  in  a  long  campaign.  In  the  early  stages  of  a  relatively 
long  campaign,  the  stronger  side  defends  itself  against  a  concentrated  at¬ 
tack  by  the  weaker  side.  During  this  period,  Blue  dispatches  on  air  defense 
a  force  of  planes  approximately  the  size  of  Red’s  entire  force.  You  will 
recall  that  a  particular  value  for  the  air-defense  effectiveness  was  assumed. 

Table  5  summarizes  the  optimal  tactics  for  campaigns  consisting  of  at 
most  eight  strikes.  The  tabulation  gives  the  optimal  allocation  for  each 
strike  (where  the  strike  number  is  defined  by  the  number  of  strikes  remain¬ 
ing  in  the  campaign)  as  a  function  of  the  relative  sizes  of  the  forces  at  the 


Table  5.  Optimal  Allocation  op  Forces  Among  Three  Tasks 


(Strong  side  having  p  forces  and  weak  side  having  q  forces,  p  >  q) 


Duration  of 
campaign 
{no.  of  strikes 
remaining  in 
campaign) 

Relative  initial 
strengths  of 
opponents  {ratio 
of  strong  side 
to  weak  side) 

p/q 

Optimal  initial  allocation  by  strong  side 
{force  size  allocated  to) 

Optimal  initial  allocation  by  weak  side 
{probability  of  concentrating  forces  on) 

Value  of  game 

Counter 

Air 

Air 

Defense 

Ground 

Support 

Counter 

Air 

Air 

Defense 

Ground 

Support 

i 

1.00  to  oo 

0 

0 

V 

0 

i 

P  -  Q 

2 

1.00  to  oo 

0 

0 

V 

0 

0 

i 

2  (p  -  g) 

3 

1.00  to  2.00 

Q 

0 

p  -  q 

0.50 

0.50 

■|;:i 

3(p  —  g) 

2.00  to  «> 

1.5  q 

0.5g 

p  -  2g 

0.50 

u 

3(p  -  g) 

4 

1.00  to  2.33 

0.5  v  +  0.5g 

0.5p  —  0.5g 

0.50 

0.50 

4.5  (p  -  g) 

2.33  to  "o 

1.67g 

0.67g 

p  -  2.33 

0.33 

0.33 

■  1 

4.00p  -  3.33g 

1.00  to  1.1 

0.41p  +  0.59g 

0.59p  —  0.59g 

0 

0.53 

0.47 

0 

6.35p  -  6.35g 

5 

1.70  to  2.45 

0.55p  +  0.30g 

0.45p  —  0.36g 

0 

0.45 

0 

5.82  p  —  5.45? 

2.45  to  oo 

1.70g 

0.75g 

p  -  2.45 

0.25 

0.45 

5.00p  —  3.45g 

1.00  to  1.44 

0.32p  +  0.68g 

0.68p  -  0.68g 

0 

0.56 

0.44 

0 

8.39p  -  8.39g 

6 

1.44  to  1.78 

0.40p  +  0.56g 

0.60p  —  0.56  q 

0 

0.52 

0.48 

0 

8.00  p  —  7.83  q 

1.78  to  2.51 

0.59p  +  0.22g 

0.41p  -  0.22g 

0 

0.41 

0.59 

0 

7.04 p  -  6.12g 

2.51  to  oo 

1.71g 

0.80g 

p  -  2.51 

0.20 

0.29 

0.51 

6.00p  —  3.51? 

1.00  to  1.29 

0.25p  +  0.75g 

0.75 p  -  0.75g 

0 

0.58 

0.42 

0 

10.50p  -  10.50g 

1.29  to  1.53 

0.29p  +  0.70g 

0.71p  -  0.70g 

0 

0.57 

0.43 

0 

10.26p  -  10.19g 

7 

1.53  to  1.84 

0.41p  +  0.51g 

0.59p  -  0.51g 

0 

0.50 

0.50 

0 

9.54p  -  9.09g 

1.84  to  2.55 

0.63p  +  O.llg 

0.37p  -  O.llg 

0 

0.37 

0.63 

0 

8.21p  -  6.64g 

2.55  to  oo 

1.72g 

0.84g 

p  -  2.55 

0.17 

0.28 

0.55 

7.00p  -  3.55g 

0.20p  +  0.80g 

0.80p  —  0.80? 

0 

0.60 

0.40 

0 

12.60p  -  12.60g 

0.22p  +  0.78g 

0.78p  -  0.78g 

0 

0.59 

0.41 

0 

12.48p  -  12.45g 

8 

1.40  to  1.59 

0.28p  +  0.69g 

0.72p  -  0.69g 

0 

0.56 

0.44 

0 

12.06p  -  11.86g 

1.59  to  1.88 

0.42p  +  0.46g 

0.58p  -  0.46g 

0 

0.49 

0.51 

0 

11.03p  -  10.22g 

1.88  to  2.58 

0.66p  +  0.01? 

0.34p  —  0.01? 

0 

0.34 

0.66 

0 

9.36p  -  7.07g 

2.58  to  oo 

1.72g 

0.86g 

p  -  2.58 

0.14 

0.28 

0.58 

8.00p  -  3.58g 
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time  of  that  strike.  However,  the  value  of  the  game,  which  is  given  in  the 
last  column  of  the  tabulation,  is  for  the  campaign  of  given  duration.  Using 
the  definitions  of  optimal  strategy,  one  can  verify  the  values  given  in  the 
table. 


INFINITE  GAMES  WITH 
SEPARABLE  PAYOFF  FUNCTIONS 


1.  INTRODUCTION 

In  Chapter  3  we  described  a  general  method  or  procedure  for  solving 
finite  games.  Thus,  given  a  game  with  a  finite  number  of  strategies,  we 
can  compute  all  the  optimal  strategies  of  Blue  and  Red.  Since  the  method 
requires  inverting  a  finite  number  of  submatrices,  it  is  a  finite  process. 

No  general  method  of  solution  exists  for  infinite  games.  That  is,  given 
an  arbitrary  game  with  an  infinite  number  of  strategies,  we  do  not  have  a 
finite  process  for  obtaining  the  optimal  strategies  of  Blue  and  Red.  How¬ 
ever,  there  are  some  special  methods  applicable  to  special  games  classified 
according  to  the  form  of  the  payoff  function.  We  have  already  described 
in  Chapter  7  a  method  of  solution  for  a  large  class  of  games,  those  games 
whose  payoff  is  a  convex  function  of  Red’s  strategies.  We  shall  now  discuss 
another  large  class  of  games,  those  whose  payoff  function  is  a  separable 
function  of  Blue’s  and  Red’s  strategies. 

2.  DEFINITION 

Suppose  Blue  chooses  a  strategy  x  from  the  interval  0  <  x  <  1  and 
Red  chooses  a  strategy  y  from  the  interval  0  <  y  <  1.  If  the  payoff 
function  has  the  form 

n  m 

M(x,  y)  =  S  2  ai,ri(x)Sj(y) 

j=i »= i 

where  the  functions  r,  and  s,  are  continuous,  then  the  game  is  called  a 
separable  game. 

Suppose  that  n  <  m.  Then  by  rewriting  the  payoff  function  as 
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n  r  m  “1 

M(x,y)  =  2  2  aijrAx)  \Sj(y), 

j = i  L*  =  i  J 

we  can  represent  the  payoff  function  as  a  single  sum 

M(x,y )  =  2  r'j{x)Sj(y), 

}=1 

where  r'j{x)  =  2  atJr,(x).  However,  we  shall  use  the  double-sum  form  of 
1  =  1 

of  the  payoff  function  since  it  provides  greater  mathematical  flexibility. 

3.  MOMENTS 

A  mixed  strategy  for  Blue  may  be  represented  by  a  cumulative  prob¬ 
ability  distribution  function  F(x).  ,A  mixed  strategy  for  Red  is  a  cumu¬ 
lative  probability  distribution  function,  G(y).  The  expectation,  or  mixed 
strategy  payoff,  for  a  separable  game  is  then  given  by 

<t>{F,  (?)  =  JQl  £  M(x,  y )  (lF{x)  dG(y) 


n  m  r\  r  l 

=  22  L  ri(x)dF( x)  Sj(y)dG(y). 

3=  1  •'  =  1  J° 

Let 


r,  =  Jo  n{x)  dF(x), 

*  =  1,  2,  •  ■ 

•  ,m, 

and 

s,  =  /0X  sAy)  dG(y), 

j  =  1.  2, .  . 

.,n. 

Then  to 

each  distribution  function  F(x)  there 

corresponds 

a  vector 

r  =  (n,  r2, ,  rm).  To  each  distribution  function  G(y)  there  corresponds 
a  vector  s  =  (si,  s2,  .  .  . ,  s„).  We  shall  refer  to  the  vector  r  as  the  m 
moments  of  F{x).  Similarly  the  vector  s  represents  the  n  moments  of  G{y). 

In  terms  of  the  components  of  these  vectors  the  mixed  strategy  payoff 
is  the  bilinear  form 

n  m 

E(Tj  s)  =  2  2  GE  ijTiSj. 

1 i=l 

4.  EQUIVALENCE  OF  Fix)  AND  POINTS  OF  CONVEX  SET  R 

Letting  F(x)  vary  over  all  distribution  functions,  we  obtain  a  set  R  of 
points  r  =  (ri,  r2,  •  ■  ■  ,  rm).  We  shall  now  show  that  this  set  is  the  convex 
set  D  spanned  by  the  curve  C  traced  out  in  rn  dimensions  by 

r<  =  r,(x),  i  —  1,  2, ... ,  m} 

as  x  varies  between  0  and  1. 
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Consider  the  convex  set  D  spanned  by  C.  Suppose  r°  =  (r?,  .  .  .  ,  r°m) 
is  a  point  of  R  not  in  D  and  let  F°(x)  be  a  distribution  function  which 
yields  r°,  or 

r?  =  n(x)  dF°(x )  i  =  1,  2, .  .  . ,  m. 

Then  there  exists  some  hyperplane  separating  r°  from  D.  That  is,  for 
some  fixed  5  >  0, 

m  m 

2  CiT?  -  2  CiTi(x)  >  S 

t=l  1=1 

for  all  x  in  0  <  x  <  1. 

Integrating  both  sides  with  respect  to  dF°(x),  we  obtain 

m  ri  m  r  i  /m 

2  CiTi  f  dF\x)  -  2  C,  f  r,(x)  dF\x)  >  5  dF\x), 

t  =  l  JO  1  =  1  jo  JO 

m  m 

or  2  CiT?  —  2  CiT?  >  S, 

1=1  i=l 

giving  the  contradiction  0  >  0.  This  proves  that  all  points  of  R  are  in  D. 

Conversely,  we  shall  show  that  every  point  in  D  is  in  R.  Suppose  r°  is 
some  point  in  D.  That  is,  the  components  of  r°  may  be  represented  as 

m 

Ti  =  2  akri(xk),  i  =  1,  2, .  .  . ,  m, 

k=i 


m 

where  a*  >  0  and  2  ak  =  1.  It  is  easily  seen  that  the  distribution 

4  =  1 

function 

m 

F°(x)  =  2  aJxt(x) 

4  =  1 

yields  the  point  r°.  Hence  every  point  of  D  is  in  R. 

In  a  similar  manner  we  can  show  that  if  wre  allow  G(y)  to  vary  over  all 
distribution  functions  we  obtain  a  set  S  of  points  s  =  (si,  S2, ,  s„)  which 
is  identical  with  the  convex  set  spanned  by  the  curve  C'  traced  out  in  n 
dimensions  by 

s>  =  s,(y),  j  =  1,2, ...  ,n, 

as  y  varies  between  0  and  1. 

5.  BILINEAR  GAME  OVER  A  CONVEX  SET 

We  have  shown  that  if  an  infinite  game  has  a  separable  payoff  function 
every  mixed  strategy  F(x)  corresponds  to  some  point  r  in  a  convex  set  R. 
Conversely,  every  point  in  R  corresponds  to  at  least  one  mixed  strategy 
F(x).  Therefore,  the  selection  of  a  mixed  strategy  is  equivalent  to  the 
selection  of  a  point  in  a  convex  set  R  in  m-dimensional  space. 
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A  separable  game  having  a  payoff  function 

n  to 

M (x,  y)  =  2  2  a.yr,(:r)s.,(t/) 

J-l i=l 

is  thus  equivalent  to  the  following  game:  Blue  chooses  a  point 
r  =  (n, .  .  .  ,rm)  of  a  convex  set  R  lying  in  Euclidean  m-space.  Red 
chooses  a  point  s  =  (si,  .  .  . ,  sn)  of  a  convex  set  S  in  Euclidean  n-space. 
The  payoff  from  Red  to  Blue  is  given  by 

n  to 

E(r,  s)  —  2  2  c  ,,r  1-Sy. 

y=i t=i 

Further,  the  set  R  is  the  convex  set  spanned  by  the  curve  defined  by 

ri  —  riix),  i  =  1,2,  ...  ,m, 

as  x  varies  between  0  and  1.  The  set  S  is  the  convex  set  spanned  by  the 
curve  defined  by 

Sj  =  Sj(y),  j  1>  2 ,  .  .  .  ,  n, 

as  y  varies  between  0  and  1. 

Optimal  strategies  of  the  two  players  are  given  by  points  r°,  s°  such  that 
min  E(r°,  s)  =  max  E(r,  s° )  =  v, 

sbS  tbR 

where  v  is  the  value  of  the  game.  From  the  minimax  theorem  for  bilinear 
forms  over  convex  sets,  it  follows  that  there  exist  optimal  strategies  for  the 
players.  Of  course,  the  existence  of  optimal  strategies  F*(x),  G*{y)  for  the 
original  game  in  terms  of  the  separable  payoff  function  was  guaranteed  by 
the  fact  that  r,(x)  and  Sj(y)  are  continuous. 

6.  DISTRIBUTION  FUNCTION  F(x)  AND  POINTS  OF  CONVEX  SET  R 

We  have  seen  that  to  each  distribution  function  F{x)  there  corresponds 
some  point  r  of  R,  and  conversely.  Every  point  on  the  curve  C  has  for 
its  coordinates 

n  =  r;(f),  i  =  1,  2, .  .  .  ,  m;  0  <  t  <  1. 

Now  every  one-step  distribution  function  F(x)  =  It(x),  where  0  <  t  <  1 
corresponds  to  a  point  r  of  R  such  that 

ft  =  rt(x)  dlt(x)  =  r<(f)  i  =  1,2, ...  ,m. 

Therefore  points  on  C  correspond  to  one-step  distribution  functions. 

Every  point  in  R  can  be  represented  as  a  convex  combination  of  at 
most  m  points  of  C.  It  follows  from  above  that  every  point  of  R  corresponds 
to  a  step-function  having  at  most  m  steps. 
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7.  NUMBER  OF  STEPS  IN  STEP-FUNCTION  SOLUTION  OF  GAME 

Every  separable  payoff  function  may  be  expressed  as 

n  /  m  \ 

M(x,y)  —  2(2  CLijTiix )  )Sj(y) 

=  2  r'j{x)s1(y) 

j  =  l 

where  n  <  m.  This  is  a  bilinear  game  played  over  the  convex  sets  R' ,  S 
determined  by  the  curves  r'  =  r'j(x)  and  s,-  =  s,(y),  respectively.  Since  the 
dimensions  of  both  R  and  S  are  n,  R'  and  S  are  determined  by  step-func¬ 
tions  with  at  most  n  steps.  Therefore,  each  player  has  an  optimal  mixed 
strategy  with  at  most  min  ( m ,  n)  steps. 

Suppose  the  payoff  function  is  further  generalized  and  is  given  by 

«  m 

M {x,  y)  =  2  2  ai,ri(x)sj(y), 

1 i=l 

and  the  convergence  is  uniform  in  y.  Then  the  functions 

00 

St(y)  =  2  classy),  i  =  1,2, ...  ,m, 

3“  1 

are  continuous.  The  payoff  may  therefore  be  rewritten 

m 

M{x,  y)  =  2  ri(x)s'i(y) 

t  =  l 

From  this  it  follows  again  that  each  player  has  an  optimal  mixed  strategy 
with  at  most  m  steps. 

8.  SOLUTION  OF  SEPARABLE  GAMES 

A  solution  of  a  separable  game,  or  a  pair  of  optimal  strategies  of  the 
two  players,  is  a  pair  of  points  r*  of  R  and  s*  of  S  such  that 

m,n  m,n 

min  2  =  max  2  a.-yTiS*  =  v, 

8tS  i,j  =  1  rzR  i,j  =  1 

where  and  Sj  are  the  components  of  r,  s,  respectively.  We  shall  show 
how  to  find  r*,  s*. 

The  payoff  function  for  a  separable  game  may  be  written  as  follows: 

n  /  m  \  m  /  n  \ 

E(r,  s)  =2(2  dipt  k  =  2(2  ofj-s>  )r„ 

y  =  l  \i  =  l  /  i  =  i  \,  =  i  ) 
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If  wc  let 

m 

fj(r)  —  S  dijTi  j  —  1)  2, .  .  .  ,  fly 

i  —  l 

Qi  (s)  —  S  dijSj  i  1,  2,  ...  j  771 } 

j  =  l 

we  have 

n  m 

E{r}  s)  =  2  =  2 

j  =  1  i  =  1 

The  planes  /,*(r)  =  0  can  be  considered  to  divide  the  space  R  into  a  finite 

number  of  portions  Rh  .  .  .  ,  Ri,  .  .  .  ,  Ri-  For  each  point  r°  of  R{  consider 

the  set  of  points  S(r°)  of  S  where  min  S  //r°)s;  is  assumed.  This  is  a  convex 

<s 

set  generally  lying  on  the  boundary  of  S.  Similarly,  the  planes  gt(s)  =  0 
divide  the  space  S  into  a  finite  number  of  portions  <Sl,  .  .  .  ,  Sj,  .  .  . ,  SP. 
Consider,  for  each  s°  of  S,  the  set  of  points  R(s°)  where  max  2  gr,(a:)r,-  is 

reached.  If  a  point  r°  has  the  property  that  S(r°)  contains  a  point  s°  for 
which  R(s°)  contains  r°,  then  r°  is  an  optimal  strategy  for  Blue.  A  similar 
analysis  can  be  applied  to  Red. 

By  considering  each  region  Ri  of  R\,  Ri,  .  .  .  ,  Ri  and  using  the  above 
mapping  of  R  into  the  space  S  and  then  mapping  the  regions  of  S  into  R, 
we  can  obtain  all  solutions  of  both  players  as  the  fixed  points  in  these 
mappings. 

Example.  Consider  the  following  separable  payoff  function: 

M(x,  y )  =  y2(^ cos  |  x  +  sin  |  x  -  1^  +  ^cos  2  x  ~  3  sin  2  X) 


This  function  does  not  have  a  saddle-point,  hence  the  game  has  a  mixed 
strategy  solution.  Now  the  choice  of  a  mixed  strategy  Fix)  by  Blue  is 
equivalent  to  his  choosing  a  point  r  =  (n,  r 2)  in  a  set  R  which  is  the  convex 
set  spanned  by  the  curve 

ri  =  sin  ^  x,  0  <  x  <  1, 

r2  =  cos  ^  x,  0  <  x  <  1. 

Similarly,  Red  chooses  a  point  s  =  («i,  s2)  in  a  set  S  which  is  the  convex 

set  spanned  by  the  curve 

Si  =  V, 

Si  =  y2, 


(  .  TT 

5  sm  -z  x 
K  £ 


3  cos 


I1)' 


0  <  y  <  1, 
0  <  y  <  1. 
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The  bilinear  payoff  becomes 

E(r,  s )  -  s2(ri  +  r2  —  1)  +  $Si(r2  —  3 n)  +  £(5ri  —  3r2) 

=  n(s2  -  4sx  +  1)  +  r2(s2  +  |.Si  —  1)  —  s2. 

In  Fig.  19,  the  convex  set  R  is  the  set  of  points  bounded  by  the  circular 
arc  ABC  and  the  line  ADC.  The  line  BD,  or  =  -fr2  —  4n  =  0,  divides 
the  space  R  into  three  regions — BD  where  k  =  0,  DAB  where  k  >  0,  and 
BCD  where  h  <  0. 


The  convex  set  S  is  the  set  of  points  bounded  by  the  parabolic  arc 
OMN  and  the  line  OQN.  The  line  VW,  or  mi  —  s2  —  4si  +  f  =  0,  the 
line  QT  or  m2  =  s2  +  |-si  —  1=  0,  and  the  line  LM  or  mi  —  m2  <  0, 
decompose  S  into  five  regions  as  follows: 


OQLW, 

where  mi  >  0, 

m2  <  0, 

excluding  mi  =  m2  =  0; 

QVL, 

mi  >  0, 

m2  >  0, 

mt  =  m2  =  0; 

LVNT, 

mi  <  0, 

m2  >  0, 

mi  =  m2  =  0; 

WLT, 

mi  <  0, 

m2  <  0, 

mi  =  m2; 

LM, 

mi  —  m2  <  0. 

The  set  R  is  decomposed  into  three  regions  by  means  of  h  =  0  and 
Z2  =  0,  namely : 

ABD,  where  li  >0,  l2  >  0,  excluding  h  =  h  =  0; 

BCD,  h  <0,  k  <  0,  h  =  k  =  0; 

D,  li  =  l2  =  0. 

For  every  point  r°  in  ABD,  E(r°,  s)  assumes  its  minimum  at  Si  =  s2  =  0. 
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However,  for  the  point  s°  =  (0,  0)  =  0,  E(r,  0)  assumes  its  maximum  at 
r  =  (1,  0)  which  is  outside  ABD.  Therefore,  no  point  in  ABD  can  be  an 
optimal  strategy  for  Blue. 

Now  consider  the  region  LVNT  of  S.  For  each  point  s°  of  this  region, 
E(r,  s°)  assumes  its  maximum  at  r2  =  1,  which  is  in  region  ABD.  But 
region  ABD,  we  have  seen,  maps  into  s  =  0  which  is  in  region  OQLW,  and 
not  in  LVNT.  Thus  no  point  of  LVNT  can  provide  an  optimal  strategy. 

We  can  summarize  the  complete  mapping  of  the  various  regions  of  the 
two  spaces  as  follows: 

In  Space  R  : 

every  point  of  ABD  maps  into  0  of  space  <S; 

each  point  of  BCD  maps  into  some  point  of  OWMN  of  S; 

point  D  maps  into  every  point  of  S. 

In  Space  S : 

every  point  of  OQLW  maps  into  C  of  space  72; 

each  point  of  QVL  maps  into  some  point  of  ABC  or  R; 

every  point  of  LVNT  maps  into  A  of  R) 

each  point  of  WLT  maps  into  either  A  or  C  of  R; 

every  point  of  LM  maps  into  every  point  of  ADC  or  R. 

To  find  the  solutions,  or  the  fixed  points,  it  is  necessary  to  combine  the 

two  mappings.  Letting - » stand  for  “map  into,”  we  can  combine  the 

above  mappings  as  follows: 

every  point  of  OQLW - >  C  — *  N; 

each  point  of  QVL - *  some  point  of  ABC - »  0,  OWMN ; 

every  point  of  LVNT - >  A - *  0; 

each  point  of  WLT - >  A  or  C - *  0  or  N; 

every  point  of  LM - *  every  point  of  ADC  =  AD  +  DC 

+  D - >0,N,S. 

It  is  seen  that  only  the  last  of  the  five  regions  yields  fixed  points.  For, 
in  this  case  we  have,  in  particular, 

LM  — »  D  — »  LM. 

The  solutions  of  the  game  are  therefore: 
for  Blue:  D:  ri  =  r2  =  f ; 

for  Red:  LM:  si  =  §,  s2  <  §. 

The  value  of  the  game  is  v  —  —  |. 

We  can  express  the  solutions  in  terms  of  cumulative  distribution  func¬ 
tions  as  follows: 

F*(x)  =  f  Ia(x)  +  \h{x), 

G*(y)  =  (l  -  -)h(y)  +  |  o  <  t  <  f. 
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Blue  has  a  unique  step-function  solution,  whereas  Red  has  an  infinite 
number  of  step-function  solutions. 


9.  LOCAL  DEFENSE  OF  TARGETS  OF  EQUAL  VALUE 

The  solution  to  the  problem  of  how  to  defend  targets  of  equal  value 
yields  results  which  are  not  immediately  evident.  In  defending  such  a 
system  of  targets,  it  is  sometimes  more  profitable,  depending  on  the  payoff 
function,  to  leave  some  targets  undefended  and  to  defend  others  heavily. 
Although  all  the  targets  may  have  the  same  value,  it  is  frequently  a  poor 
strategy  to  provide  the  same  defense  to  each  of  them.  In  the  model  we 
shall  study,  it  turns  out  that  if  the  defender  has  a  relatively  small  total 
force,  then  he  never  defends  all  targets  but  rather  defends  some  heavily 
and  leaves  the  remainder  unprotected. 

Case  1.  Indivisible  Defense. 

Suppose  the  Defender  has  D  units  of  defense  to  distribute  among  n  tar¬ 
gets  of  equal  vlaue.  Further  suppose  that  the  Defender  may  assign  either 
0,  1,  or  2  units  of  defense  to  each  target — i.e.,  the  defense  units  are  indi¬ 
visible,  and  further  that  D  <  2 n.  Assume  that  the  Attacker  has  a  total  of 
A  units,  where  A  <  2 n,  with  which  to  attack  each  target  with  either  0,  1, 
or  2  units. 

We  have  an  Attack-Defense  game  where  both  opponents  know  the 
parameters  n,  A,  and  D.  A  strategy  for  the  Attacker  is  the  choice  of  some 
number  t, 

°s‘Si i 

representing  the  fraction  of  the  targets  subjected  to  attack  by  two  units. 
Therefore,  the  fraction  of  targets  attacked  by  single  units  is  A/n  —  2 1. 
Of  course,  a  strategy  for  the  Attacker  chooses  only  the  number  of  targets 
to  be  attacked  and  leaves  the  selection  of  the  specific  targets  to  some 
random  device.  A  strategy  for  the  Defender  is  the  choice  of  some  number 
u,  where 

0  <  u  < 

2  n 

representing  the  fraction  of  targets  to  be  defended  by  two  units.  Therefore 
D/n  —  2 u  is  the  fraction  of  targets  having  single  defense  units. 

A  possible  payoff  to  the  Attacker  is  the  expected  number  of  targets 
on  which  at  least  one  bomb  will  be  dropped.  Now  for  each  target,  the 
expectation  of  its  being  destroyed  will  depend  upon  whether  it  is  attacked 
by  1  or  2  units  and  defended  by  0,  1,  or  2  units.  Thus  for  each  target  we 
have  the  payoffs  shown  in  Table  6. 
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Table  6.  Local  Defense  of  Targets 


(1) 

Number 

attacking 

units 


(2) 

Number 

defending 

units 


(3) 

Probability 
of  (1)  and  (2) 


+*) 

(f-20(?-2M) 

(» -  20u 
‘0  -»  +  “) 

'(» _  2“) 


(4) 

Foj/o#  if 
(1)  and  (2) 


(5) 

Expected 

Payoff 


(£-2‘)(1-»+“) 


(1-®  +  “) 
(“  -  2“) 


3/0 
4* 


The  payoff  associated  with  n  targets  will  be  given  by  n  times  the  sum 
of  column  (5)  or 


*«.  “)  =  "  [(t  -  2!)(>  -  £  +  “)  +  '('  -  f  +  “)  +  V (n  ~  2“)] 

—  2  n^u  +  ^  D  —  +  Am  +  A  — 


AD. 

n 


Since  M{t,  u)  is  bilinear  in  t  and  u,  it  has  a  saddle-point,  which  is 
given  by 

2/7  D 


.  2  A 

t  —  -  — >  U 

o  n 


a°-A 

5\4b  / 


t  =  0,  u  =  0, 
The  value  of  the  game  is 

=  4(„_d) 


D  4 

if  —  > 
n  7 

if  -  <  =■ 
n  7 


.fD  4 

n-  7 

if  2  4 


For  example,  if  the  Defender  has  100  defense  units  with  which  to 
defend  100  targets  of  equal  value  against  an  Attacker  having  a  total  of 
100  units,  an  optimal  strategy  for  the  Defender  is  to  place  two  defense 
units  at  each  of  30  targets,  one  unit  of  defense  at  each  of  40  targets,  and 
leave  30  targets  undefended.  An  optimal  strategy  for  the  Attacker  is  to 
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use  two  attack  units  on  each  of  40  targets,  one  attack  unit  on  each  of  20 
targets  and  no-attack  on  40  targets.  Of  course,  the  specific  targets  to  be 
defended  or  attacked  are  picked  at  random.  The  expected  number  of 
targets  destroyed  is  30. 

If  the  Defender  had  placed  one  unit  at  each  of  the  100  targets,  then  the 
Attacker  could  destroy  as  many  as  37.5  targets  by  attacking  each  of  50 
targets  with  two  units. 

Case  2.  Arbitrary  Defense. 

If  we  assume  that  the  Defender  is  able  to  distribute  his  forces  con¬ 
tinuously- — i.e.,  he  can  place  any  part  of  a  defense  unit  at  a  target — then 
the  local  defense  problem  becomes  an  infinite  game  with  a  separable  payoff 
function.  Again,  the  best  defense  strategy  may  require  a  heavy  defense 
for  some  targets  and  no  defense  for  others.  As  before,  the  Defender  has 
D  units  of  defense  with  which  to  defend  n  targets  of  equal  value,  where 
D  <  2 n.  He  is  able  to  distribute  continuously  these  D  units  among  the 
n  targets.  The  Attacker  has  A  units  of  attack,  where  A  <  2 n,  with  which 
to  attack  some  or  all  n  targets.  However,  the  Attacker  may  engage  any 
given  target  with  zero,  one,  or  two  units  only.  The  payoff  to  the  Attacker 
is  measured  by  the  expected  number  of  targets  destroyed. 

A  strategy  for  the  Defender  is  a  distribution  function  G(y)  representing 
the  fraction  of  his  targets  defended  by  no  more  than  -y  units  of  defense  at 
each  target.  Since  no  target  will  be  attacked  by  more  than  two  units,  it 
would  be  wasteful  to  have  y  larger  than  2. 

A  strategy  for  the  Attacker  is  some  number  t  where  u  =  t  <  A/2n, 
representing  the  fraction  of  targets  to  be  attacked  by  two  units.  The  se¬ 
lection  of  particular  targets  to  be  attacked  and  particular  targets  to  be 
defended  is  made  at  random  subject  to  the  given  fractions. 

The  payoff  to  the  Attacker  is  the  expected  number  of  targets  destroyed. 
The  Attacker  uses  2 In  units  of  attack  for  double  attacks  on  targets  and  the 
remainder,  A  —  2 tn  units,  for  single  attacks.  Now  consider  a  target  which 
is  defended  by  y  units  of  defense  and  is  under  double  attack.  The  prob¬ 
ability  that  the  target  is  destroyed  is  the  probability  that  the  two  attacking 
units  are  not  destroyed,  or 


Now,  the  probability  that  a  given  target  has  no  more  than  y  units  of 
defense  is  G(y) ;  it  follows  that  the  Attacker’s  a  'priori  expectation  of 
destroying  any  given  target  is 

a  -  £  (>  -  f)  <«<»>• 

For  the  2 tn  targets,  the  Attacker’s  expectation  for  his  double  attacks 
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will  be  2tnEu  Similarly,  the  Attacker’s  expectation  for  singly  attacked 
targets  is 

(A  -  2 nt)  JQl  (1  -  y)  dG(y) 

with  the  upper  limit  of  integration  extending  only  to  1,  since  no  payoff  is 
expected  for  targets  defended  by  more  than  one  unit.  Therefore,  the  total 
payoff  to  the  Attacker  corresponding  to  each  t  and  G  will  be 

(11.1)  M(t,  G)  =  2  nt  f*  (l  -  |-2)  dG(y)  +  (A  -  2  nt)  £  (1  -  y)  dG(y). 

The  Defender  must  distribute  all  D  units  of  defense  among  his  n  targets; 
hence  he  is  restricted  to  pick  only  those  G’s  for  which 

(11.2)  n  J*  ydG(y)  =  D. 

We  thus  have  a  game  in  which  the  maximizing  player  picks  some  num¬ 
ber  t  in  the  range  0  <  t  <  A/2n,  and  the  minimizing  player  picks  a  distri¬ 
bution  function  satisfying  (11.2),  where  the  payoff,  M{t,  G),  to  the  Attacker 
is  given  by  (11.1).  There  exist  optimal  strategies  for  the  two  players  which 
will  necessarily  depend  on  the  parameters  S  =  A/n  and  T  —  D/n,  average 
attack  per  target  and  average  defense  per  target.  The  solutions  are  ob¬ 
tained  by  analyzing  the  convex  set  spanned  by  the  curve  (m,  r2)  defined  by 

Mi  =  /02  (l  -  f-2)  dG(y)  -  2  /;  (1  -  y)  dG{y), 

M2  =  f*  y  dG(y), 

for  all  distribution  functions  G(y)  over  the  interval  (0,  2).  In  terms  of  this 
new  set  the  game  has  a  saddle-point  which  is  readily  obtained  by  the 
mapping  method. 

Table  7  presents  a  complete  summary  of  the  solutions  to  the  game  and 
their  values  for  all  possible  values  of  average  defense  per  target  and  average 
attack  per  target.  The  solutions  are  independent  of  the  number  of  targets, 
n,  but  depend  on  the  parameters  S  and  T.  One  can  easily  verify  the 
solutions  shown  in  the  table. 

In  region  I  defined  by  0  <  S  <  1  and  0  <  T  <  4  —  2V3,  the  De¬ 
fender  may  distribute  his  D  units  equally  among  the  n  targets,  giving  each 
target  D/n  units.  In  region  II  he  may  leave  some  targets  undefended  and 
defend  the  remaining  targets  equally.  In  region  III  the  Defender  must 
leave  some  targets  undefended  and  place  either  one  or  two  units  of  defense 
at  the  remaining  number  of  targets.  In  region  IV,  the  Defender  places  two 
units  at  each  of  the  D/2  targets  and  the  rest  necessarily  are  undefended. 
In  region  V  he  places  one  unit  at  each  of  D  targets  leaving  n  —  D 
undefended.  The  Attacker  has  a  pure  strategy. 


Table  7.  Local  Defense  of  Targets — Arbitrary  Defense 


Region  No. 

Range  of 

Optimal  strategies 

\ 

Value  of  game 

V 

=  A 

n 

T  =  — 
n 

Attack 

t* 

Defense  G* 

I 

0  to  1 

0  to  4  -  2V3 

0 

Any  G*  such  that  f  dG*(y)  =  1 

(1  -  T)S 

II 

0  to  1 

O 

ICO 

> 

<N 

1 

0 

Any  G*  such  that  dG*(y)  =  1,  J  y*  dG*{y)  =  8T  -  4 

(1  -  T)S 

III 

ot°i 

4 

7  to  2 

i* 

3(2  —  T)  T  ,4-2 T  T  .  7T  -  4  r 

10  /o  1  5  h  1  10  h 

|(2-0S 

IV 

lto2 

0  to  2 

S  -  1 

(‘-I>  +  §h. 

‘-f 

V 

ltoi 

0 

0 

s  -  1 

(1  -  T)J„  +  th 

4  —  7T  +  3  ST 

4 
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For  example,  if  the  Defender  has  100  targets  to  defend,  then,  depending 
on  the  numbers  of  defense  and  attack  units,  the  optimal  strategies  are 
given  by  Table  8. 


10.  SOLUTION  OF  POLYNOMIAL  GAMES 


If  the  payoff  is  a  polynomial  function  of  the  two  strategy  variables 
x  and  y,  we  call  this  separable  game  a  polynomial  game.  By  introducing 
moments  we  transform  the  polynomial  game  to  a  bilinear  game  over  a  con¬ 
vex  set,  usually  referred  to  as  the  moment  space.  The  transformed  game 
can  be  solved  by  mapping  and  determining  the  fixed  points. 

We  shall  describe  the  mapping  method  of  solving  polynomial  games  by 
illustrating  it  for  a  particular  game.  Suppose  the  payoff  to  Blue  is  given  by 


M( x,  y ) 

=  21x  +  18a;2  -  24a;3  -  16 y  -  36a :y  -  9x2y  +  18a;3y  +  60 y2  -  36 y\ 

where  0  <  x  <  1  and  0  <  y  <  1  are  pure  strategies  of  Blue  and  Red, 
respectively.  Let  F(x)  and  G(y)  be  mixed  strategies  of  Blue  and  Red. 
Define  the  following  moments : 


/<  =  JQlx'dF(x),  i=  1,2,3, 

gi  =  f*  V’  dG(y),  j  =  1,  2,  3. 


The  choice  of  a  mixed  strategy  F(x)  by  Blue  is  equivalent  to  his  choos¬ 
ing  a  point  /  =  (/i,  h,  fz)  in  a  convex  set  R  which  is  the  convex  hull  of  the 
curve  C  defined  by 

n  =  ad,  i  —  1)  2,  3, 


as  x  ranges  from  0  to  1.  Similarly,  each  mixed  strategy  G{y)  is  associated 
with  some  point  g  =  (gi,  32,  gz )  in  a  convex  set  S  which  is  the  convex  hull 
of  the  curve  D  defined  by 

Sj  =  y ’  j  =  1)  2,  3, 

as  y  ranges  from  0  to  1. 

In  terms  of  points  of  convex  sets  R  and  S,  the  payoff  becomes  the 
bilinear  form 


=  21/i  +  18/a  -  24/3  -  l&gi  -  36/if/i  -  9/231  +  I8/331  +  6032  ~  363s- 

The  convex  set  R  is  a  three-dimensional  volume  whose  boundary  con¬ 
sists  of  the  curve  C,  lines  connecting  the  point  (0,  0,  0)  with  each  point  of 
C,  and  lines  connecting  the  point  (1,  1,  1)  with  C.  There  are  no  planes 
which  form  the  boundary  of  R.  Of  course,  the  boundary  of  S  is  the  same  as 
the  boundary  of  R. 

We  now  have  the  bilinear  game  in  w'hicli  Blue  picks  some  point  (/ 1, 


Table  8.  Local  Defense  of  Targets — Solution  for  100  Targets 


Number 
attack 
units  A 

Number 
defense 
units  D 

Optimal  strategies: 

n  =  100 

Value  of 
game 

Number 

targets 

destroyed 

No.  Targets  Attacked  by 

Target  Defense 

2  Units 

1  Unit 

0  Units 

No. 

Targ. 

No. 

per 

Targ. 

No. 

Targ. 

No. 

per 

Targ. 

No. 

Targ. 

No. 

per 

Targ. 

No. 

Targ. 

No. 

per 

Targ. 

100 

50 

0 

50 

100 

.50 

0 

0 

0 

0 

0 

0 

50 

100 

55 

0 

55 

45 

76 

.73 

24 

0 

0 

1 

0 

2 

45 

100 

100 

40 

20 

0 

30 

0 

40 

1 

30 

2 

30 

175 

100 

75 

25 

0 

50 

0 

0 

ffKB! 

2 

50 

150 

50 

50 

50 

0 

50 

0 

50 

H 

0 

2 

31 
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in  R  and  Red  picks  some  point  (g i,  g2,  gs)  in  S.  Let  us  rewrite  the  payoff 
as  follows: 

M(J,  g) 

=  —  16<7i  +  60^2  -  36g3  +  3/i(7  -  12gi)  +  9/2(2  -  gi)  +  6/3(-4  +  3<h). 

The  maximum  value  of  a  linear  form  over  a  convex  set  is  assumed  on 
the  boundary  of  the  convex  set.  Therefore,  for  any  point  (gfi,  g-i,  <73)  chosen 
by  Red,  Blue,  who  wishes  to  maximize  M  (/,  g),  will  pick  some  point  on  C 
or  some  line  joining  (0,  0,  0)  to  C,  or  some  line  joining  (1,  1,  1)  to  C.  We 
shall  first  determine  when  Blue  selects  each  of  these  three  types  of 
boundaries. 

In  order  for  Blue  to  select  some  line  joining  (0,  0,  0)  to  a  point  on  C, 
it  is  necessary  that 

M{ 0,  g)  =  M(f°,  g), 

where  f°  is  on  C  or  /?  =  x,  f2  =  x2,  ft  =  xz.  It  is  also  necessary  that 

dM(f\ g) 
dx 


and 


dWCM<n 

dx2 


The  first  two  conditions  yield  the  equations 

3x(7  -  12?0  +  9x2(2  -  flfO  +  6x3(-4  +  3gr,)  =  0, 

3(7  -  12j0  +  18a:(2  -  gi)  +  18x2(-4  +  Zgi)  =  0. 

Solving  these  equations,  we  obtain 

x  =  i,  gi  —  f- 

These  values  satisfy  the  necessary  conditions  for  a  maximum.  Therefore, 
if  Red  chooses  a  mixed  strategy  for  which  <71  =  §,  Blue  can  maximize 
MU,  9)  by  choosing  any  point  in  R  along  the  line  joining  (0,  0,  0)  to 
(i,  i  i).  From  this  it  follows  that  if  Red  chooses  gi  >  f,  then  Blue  will 
choose  the  point  (0,  0,  0)  in  R,  and  if  Red  chooses  gi  <  f,  Blue  will  choose 
a  point  (x,  x2,  x:i)  in  R  satisfying  the  equation 


dMU,  g)  =  3^7  _  l2gj  +  18a.^2  _  gi)  +  18x2(  — 4  +  3 gj  =  0 


or  _ _ 

_  -3(2  -  gi)  -  V3(75 a2  -  150(7!  +  68) 

^  6(3<7i  -  4) 

We  have  partitioned  the  space  S.  We  now  need  to  partition  the  space 
R.  Again,  we  make  use  of  the  fact  that  the  minimum  of  a  linear  form  over 
a  convex  set  is  assumed  on  the  boundary.  Therefore,  for  any  point  ( fi,  /2,  /s) 
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chosen  by  Blue,  the  minimum  value  of  M (/,  g)  will  be  assumed  either  on  D, 
or  on  some  line  joining  (0,  0,  0)  in  S  to  D,  or  on  some  line  joining  (1,  1,  1) 
in  S  to  D. 

Let  us  rewrite  the  payoff  function  as  follows: 

M(f,  g) 

=  21/!  +  18/2  -  24/3  +  f?i(  — 16  -  36/i  -  9/2  +  18/3)  +  60g2  -  36g3. 
We  can  show  that  the  coefficient  of  gi  satisfies  the  inequality 
-43  <  -16  -  36/i  -  9/2  +  18/3  <  -16. 

The  payoff  function  M(f,  g)  will  assume  its  minimum  value  along  a  line 
joining  the  point  (1,  1,  1)  in  S  to  D  for  all  points  in  R  satisfying  the  three 
conditions: 

(i)  M{f,  1)  =  MU,  0°), 

where  g°  is  on  D  or  <7®  =  y,  g2  =  y2,  gi  =  y3, 


(ii) 

dM(f,  ff°)  n 
dy 

(iii) 

0 

Al 

% 

5  “a. 

<N 

The  first  two  conditions  yield  the  equations 
( — 16  —  36/i  —  9/2  +  18/3)  +  60  36 

=  y(— 16  —  36/i  —  9/2  +  18/3)  +  60  y2  —  36  y3, 
(-16  -  36/i  -  9/2  +  I8/3)  +  120y  -  108g2  =  0. 

Solving,  we  obtain 

-16  -  36/i  -  9/2  +  18/s  =  -28  and  y  =  i 
These  values  satisfy  the  conditions  for  a  minimum.  Therefore,  for  all  points 
in  R  for  which 

-16  -  36/i  -  9/2  +  18/s  =  -28, 

the  minimum  value  of  M{f ,  g)  is  assumed  at  each  point  of  the  line  joining 
(1,  1,  1)  to  (|,  a  it).  It  follows  that  for  all  points  in  R  for  which 

-16  -  36/i  -  9/2  +  18/s  <  -28, 

the  minimum  of  M(f,  g)  is  assumed  at  (1,  1,  1)  in  S,  and  for  all  points  in 
R  for  which 

-16  -  36/i  -  9/2  +  18/s  >  -  28, 

the  minimum  of  M(f,  ff)  is  assumed  at  some  point  (y,  y2,  y3)  in  S  satisfying 
the  equation 
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d-^/y  g)  =  -16  -  36/i  -  9/2  +  I8/3  +  120 y  -  1(%2  =  0, 

or 

=  10  -  V 100  +  3(-16  -  36/i  -  9/2  +  18/,) 
y  18 

Table  9  presents  a  summary  of  the  partitioning  and  mappings  of  spaces 


Tafcle  9 

M{x,  y )  =  21x  +  18z2  —  2ix3  —  16?/  —  36 xy  —  9 x2?/  +  18 x3y  +  60?/2  —  36 y3 


fl-SPACE 

Points  in  R  for  which 

Map  into  points  in  S 
(1 location  in  S  of  min  (/,  g)) 

li  =  — 16  —  36/,  -  9 f3  +  18/3  <  -28 

*1  =  -17  -  36/i  -  9/2  +  I8/3  >  -28 

h  =  -16  -  36/,  -  9/2  +  I8/3  =  -28 

(b  1,  1) 

10  -  V100  +  3/, 
y~  18 

P(h  h  if)  +  (1  -  |3)(1,  1,  1) 

where  0  <  0  <  1 

&-SPACE 

Points  in  S  for  which 

Map  into  points  in  R 
(, location  in  R  of  max  M(J,  g)) 

9i  >  I 

9i  <  f 

Bi  =  I 

(0, 0,  0) 

_  _  -3(2  -  gf)  -  V3 (7bg\  -  1500,  +  68) 
6(30,  -  4) 

«(b  b  i)  +  (1  -  a)(0,  0,  0) 

R  and  S.  The  solution  of  the  game,  the  optimal  strategies,  are  the  fixed 
points  in  the  mappings  listed  in  the  table.  We  test  each  of  the  three  regions 
in  each  space  for  the  existence  of  fixed  points.  From  the  table  we  note 
that  if  h  <  —28  then  M (/,  g)  assumes  its  minimum  at  (1,  1,  1)  in  S,  or 
fit  =  h  Again  from  the  table  we  note  that  if  gx  —  1  >  §,  then  M  (/,  g) 
assumes  its  maximum  at  (0,  0,  0)  in  S  or  k  =  16;  hence  h  >  -28.  There¬ 
fore  no  point  in  R  for  which  h  <  —  28  can  provide  a  solution  to  the  game. 
In  a  similar  manner,  we  can  show  that  k  >  —28  and  fifi  <  |  contain  no 
fixed  points,  and  therefore  no  solutions  of  the  game. 

Since  we  have  eliminated  two  of  the  three  regions,  it  follows  that 
7i  =  —28  and  </i  =  §  must  contain  the  fixed  points.  It  remains  to  solve 
for  the  values  of  a  and  (i.  For  the  third  region  to  contain  a  fixed  point, 
we  must  have 
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The  fixed  points  are  therefore  (|,  L  rtj)  in  R  and  (f ,  f,  tt)  in  S.  If  Blue 
chooses  a  strategy  such  that  /i  =  f,  h  —  b  h  —  t£>  then  Ibid  can  m"‘'' 
mize  M{f,  9 )  by  choosing  g,  =  f,  <72  =  I,  9z  —  If-  If  Ibid  chooses  the  pre¬ 
ceding  set  of  fir’s,  then  Blue  can  maximize  M(J,g)  by  choosing  /1  =  3, 
f2  =  1 J3  =  I!?.  This  verifies  that  these  points  are  fixed  points  and  therefore 
a  solution  of  the  game.  The  solution  can  be  expressed  by 

F*(x)  =  lh{x)  +  §/i/2(x), 

G*(y)  =  UUv)  +  ¥1  (y)- 

The  value  of  the  game  is  given  by 

v  =  fQl  M{x,  y)  dF*(x)  dG*(y)  =  6. 


11.  TACTICAL  RECONNAISSANCE— SINGLE  MISSION 

In  planning  an  attack  on  a  target  whose  military  worth  is  not  known 
exactly,  it  may  be  advisable  to  invest  in  reconnaissance  before  the  mission 
to  determine  the  exact  worth  of  the  target.  The  uncertainty  regarding  the 
worth  of  the  target  may  arise  from  unknown  or  partially  known  results  of 
earlier  strikes  on  the  same  target.  If  the  exact  worth  of  the  target  is  dis¬ 
covered  through  reconnaissance,  then  it  is  possible  to  dispatch  the  most 
efficient  size  attacking  force  against  it— i.e.,  a  large  force  will  not  be  com¬ 
mitted  against  a  worthless  target  and  a  small  force  will  not  be  sent  to  a 
valuable  target.  Since  reconnaissance  is  costly,  it  is  necessary  to  evaluate 
its  desirability  against  the  potential  outcomes  of  the  attack. 

A  successful  reconnaissance  is  assumed  to  reveal  the  exact  value  of  the 
target.  In  order  to  be  successful,  at  least  one  reconnaissance  aircraft  must 
fly  to  the  target  and  return. 

Let  us  introduce  the  following  notation: 

B  =  military  worth  of  one  bomber. 

R  =  military  worth  of  one  reconnaissance  aircraft. 

T  =  military  worth  of  the  target. 

<f)(t)  —  probability  that  the  value  of  the  target  does  not  exceed  t.  This 
probability  distribution  is  known  prior  to  the  reconnaissance. 
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r  =  number  of  reconnaissance  aircraft  sent  out  prior  to  the  mission. 
b  =  number  of  bombers  dispatched  to  the  target  during  mission. 
P  ~  one-way  survival  probability  of  bomber  and  reconnaissance 
aircraft  between  base  and  target. 

aT  =  probable  worth  of  the  target  after  being  hit  by  one  bomber. 
a2T  =  probable  worth  of  the  target  after  being  hit  by  two  bombers. 

The  object  of  the  attacker  is  to  maximize  the  net  outcome  of  the  mission, 
namely  the  difference  between  the  target  damage  and  the  aircraft  losses. 
Thus  the  payoff  will  depend  on  r  and  b  and  is  given  by 

M(r,  b)  =  j  [>(1  -  «p*)  -  (1  -  p2)Bb  -  (1  -  p*)Rr] 

where  the  first  term  under  the  integral  represents  the  target  damage  and 
the  remaining  two  terms  are  the  worths  of  the  bomber  and  aircraft  losses, 
respectively. 

Using  the  information  available,  if  any,  from  reconnaissance  and 
maximizing  on  b,  we  obtain 

max  M(r,  b)  =  M(r)  =  /{(l  -  p2)'  max  J  Gib,  t)  d<t>(l) 

+  [1  -  (1  ~  P2)r]  max  C(b,  t)  -  (1  -  p2)l?rj  d<t>(t), 
where  C(b,  t)  is  defined  by 

C(b,  t)  =  <(1  -  « *>)  -  (1  -  p*)Bb. 

Finally  the  solution  is  obtained  by  maximizing  M(r)  on  r. 

It  is  convenient  to  give  the  solution  in  terms  of  the  following  constants: 
P  =  -In  (1  -  p2), 

n  =  (1  ~  P2)B 

p  In  a 

<£i  =  J  t  d<t>(t), 

A  =  D  j  In  ^  d<t>(t). 

Further,  we  impose  two  conditions  on  these  constants: 

(i)  0(D)  =  0, 

00  AP  -  (1  -  p2)R  >  0. 

lhe  first  condition  states  that  there  is  no  chance  that  the  reconnaissance 
report  will  indicate  that  no  mission  should  be  run  at  all.  The  second  con¬ 
dition  states  that  there  is  sufficient  doubt  concerning  the  state  of  the  target 
to  make  reconnaissance  worthwhile.  If  the  second  condition  fails,  then  no 
reconnaissance  will  be  sent  out. 
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Maximizing  M(r,  b )  on  b  and  r,  we  obtain  as  the  optimal  sizes,  r*,  b*, 
of  reconnaissance  aircraft  and  bombers,  respectively,  to  be  dispatched: 


b* 


In 


(I)  , 


*  i  .  1  ,  AP 

r*  —  1  +  -  m  — • 
p  R 


—  p  In  a 


if  reconnaissance  reports  T 


In 


/4n\ 

\P)  . 


-p  In  a 


if  reconnaissance  does  not  report. 


For  these  optimal  values  of  r  and  6  we  have  the  payoff 

M{r*,  b*)  =  fc  -  D  -  D  In  ^  ^  [AP  -  (1  -  p2)«]  -  (1  -  p2)Rr*. 

If  no  reconnaissance  were  sent,  then  the  best  result  would  have  been 


M(b*)  =  4.1  -  D  -  D  In 


<tn 

D 


Example.  Suppose  the  military  worth  of  a  bomber  is  the  same  as  a 
reconnaissance  aircraft,  each  equal  to  one  unit.  Prior  to  reconnaissance, 
all  that  is  known  about  the  target  is  that  it  is  just  as  likely  to  have  a 
value  of  50  units  as  5  units.  Suppose  each  bomber  can  destroy  10  per  cent 
of  the  worth  of  a  target.  Assume  that  the  one-way  survival  probability  of 
each  aircraft  is  0.8. 

We  have  then  the  following  given  constants: 


B  =  R  =  1, 

=  ih(i)  +  hEo(t), 

a  =  0.9, 


p  =  0.8. 


Substituting 

constants: 


the  above  constants,  we  obtain  the  following  additional 

P  =  -  In  (1  -  0.64)  =  1.022, 

(1-  0.64)1 

D~  0.8  In  0.9  ~  4-275’ 

4>1  =  f050  t  [|  dm  +  |  d/M(o]  =  27.5, 

A  -  4.275  /  In  [|  dh(t)  +  |  d/60(f)]  =  2.366, 


<t>(D )  =  <£(4.275)  =  0, 

AP  =  (1  —  P2)P  =  2.058  >  0. 
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The  optimal  numbers  of  reconnaissance  aircraft  and  bombers  are 
given  by: 


-0.8  In  0.9 


1.86 


-0.8  In  0.9 


29.20 


l  -0.8  In  0.9 


22.10 


if  reconnaissance  reports  5 


if  reconnaissance  reports  50 


if  reconnaissance  does  not  report. 


Using  these  optimal  strategies,  the  attacker  receives  a  payoff 

M(r*,b*)  =  16.61. 

If  no  reconnaissance  had  been  attempted,  the  payoff  would  have  been 

M(b*)  =  15.27. 

We  may  interpret  the  solution  as  follows:  Two  reconnaissance  aircraft 
are  to  be  sent  out.  The  expected  loss  is  2(1  —  p2)  =  0.72  aircraft.  If  the 
reconnaissance  reports  that  the  target  is  worth  50  units,  then  29  bombers 
are  dispatched,  causing  50(1  —  0.9C29)(0-8))  =  45.67  units  of  damage,  with 
an  expected  loss  of  29(1  —  0.64)  =  10.44  bombers.  If  the  reconnaissance 
reports  the  target  to  be  worth  five  units,  then  two  bombers  are  sent  out 
which  destroy  0.77  units  of  the  target  and  0.72  bombers  are  expected  to  be 
lost.  If  the  reconnaissance  fails  to  report,  then  21  bombers  are  dispatched, 
and  the  damage  they  inflict  is  either  42.15  or  4.22  units,  or  an  expected  23.18 
units,  with  an  expected  loss  of  7.92  bombers.  The  probability  that  the 
reconnaissance  will  fail  to  report  is  (1  —  p2)2  =  0.13.  The  probability  that 
the  reconnaissance  will  report  50  is  the  same  as  the  probability  that  the 

1  -  (1  -  p2)2 


reconnaissance  will  report  5  and  is  given  by 


0.435. 
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also  Convex  payoff  functions 
Constraints,  games  with,  88-90 
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duels  see  Duels 
examples  of,  3-10 
in  extensive  form,  74-78 
field  commanders,  functions  of,  3-5 
guessing  game,  32-35 
infinite  see  Continuous  games 
infinite  games  with  separable  payoff 
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perfect  information,  14-15,  75,  76-78 
poker,  9-10 

polynomial,  170-175  see  also  Separable 
games 
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Mixed  strategies: 
description  of,  18-21 
and  distribution  functions,  98-100 
graphical  representation  of,  20-21 
optimal,  22-23 


INDEX  183 


Morra: 

description  of,  5 
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addition  and  multiplication  by  con¬ 
stant,  39 

“all  strategies  active,”  42-43 
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application  of  structure  theorems  to, 
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mapping  method  for  solving,  92-96 
tactical,  175-178  see  also  Separable 
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Representation  of  games,  74-76 
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description  of,  12-14 
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theorem 
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indivisible  defense,  165-167 
moments,  158 

number  of  steps  in  step-function  solu¬ 
tion,  161 
solution,  161-165 

solution  of  polynomial  games,  170-175 
tactical  reconnaissance,  175-178 
Silent  Duel,  Monotonic  Accuracies  (ta¬ 
ble),  142 

Silent  duels  see  Duels,  silent 
Simplified  poker,  9-10 
Solution  of  Polynomial  Games  (table), 
174 

Solution  of  Reconnaissance  Games  by 
Mapping  Methods  (table),  94-95 
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successive  approximations,  82-85 
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Stieltjes  integral,  100-106 
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with  respect  to  step-functions,  104-105 
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concept  of,  2-3 
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Successive  approximations,  as  method  of 
solution,  82-85 
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